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DMA SEQUENCES CODING FOR MODIFIED 
FACTOR VIII :C AND MODIFIED FACTOR 
VIII:C-LIKE POLYPEPTIDES, AND PROCESSES FOR 
5 PRODUCING THESE POLYPEPTIDES IN HIGH YIELDS 

TECHNICAL FIELD OF THE INVENTION 

This invention relates to DNA sequences 
coding for modified factor VlllrC-like polypeptides 
and processes for producing them using those DNA 

10 sequences. More particularly, this invention relates 
to the production of modified factor VIII :C and 
modified factor VIII:C-liJce polypeptides which 
display the biological activity of factor VIII :C. 
In addition, the polypeptides of this invention are 

15 produced in higher yields than previously produced 
factor VIII:C-like polypeptides and are more easily 
purified into biochemically pure mature factor 
VlllrC. 

BACKGROUND OF THE INVENTION 

20 Factor VIII :C, a large plasma glycoprotein, 

functions as the procoagulant component of factor VIII, 
which plays an integral role in the cascade mechanism 
of blood coagulation [see generally, W. J . Williams 
et al*. Hematology , pp. 1085-90, McGraw-Hill, New York 

25 (1972)]. Factor VIII:C circulates in the blood as a 
complex with factor VIIIRrAg (also known as 
von Willebrand factor protein) which is a large 
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protein associated with platelet aggregation and 
adhesive properties^ 

Factor VIII:C is synthesized as a single 
chain macromolecnleo: precursor, which is later cleaved 
5 to yield the fragments which constitute '^mature" 

factor Viri:C. Mature factor VIII:C is composed of 
two chains bridged by a calcium ion; an amino-terminal 
heavy chain of 740 amino acids, and a Ccurboxy- terminal 
light chain of 684 amino acids. The primary trans- 

10 lation product of factor VIII :C is a single chain 
in which the heavy chain of mature factor VTII:C is 
separated from the light chain by a "maturation 
polypeptide" of 908 amino acids. The excision of 
this maturation polypeptide is initiated by pro- 

15 teolytic cleavage of the primary translation product 
by an unknown or yet unidentified protease at the 
Arg 1648 - Glu 1649 peptide bond. The initial nick 
event begins a series of successive proteolytic 
cleavages which shorten the nascent heavy g!>^;^in from 

20 its carboxy terminus . Eventually the mature heavy 
chain of 740 amino acids results and in combination 
with the light chain of 684 amino acids, comprises 
mature factor VIII rC [see L.hO« Andersson et al. 
"Isolation and Characterization of Human Factor VIII: 

25 Molecular Forms In Commercial Factor VIII Concen- 
trate, Cryoprecipitate, and Plasma," PNAS(irSA) , 83, 
pp. 2979-83 (1986)]. This complex is then activated 
by thrombin by cleavage at the Arg 1689-Ser 1690 
bond [D. Eaton et al., BiochMiistry , 25, pp. 505-12 

30 (1986)]. 

Ha em o philia A is a sex-linked hemorrhagic 
disease which is caused by a deficiency, either in 
amount or in biological activity, of factor VIII :C. 
The syn^jtoms of acutely bleeding haemophilia patients 
35 are treated with factor VIII traditionally purified 
from no rmal sera. Various methods of purification 
have been described in the literature [see, Zimmerman 
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et alw united States patent 4,361,509; Saundrey 
et al. United States patent 4,578,218; E.G.D. 
Tuddenhem et al», "The Properties of Factor VIII 
Coagulant Activity Prepared By Immxinoadsorbent 
5 Chromatography, Joiimal of Laboratory Clinical 
Medicine , 93, pp. 40-53 (1979); D. E- Austen, 
"The Chromatographic Separation of Factor VIII on 
Aminohexyl Sepharose," British Journal of Hematology , 
43, pp. 669-74 (1979); M. Weinstein et al., "Analysis 

10 of Factor VIII Coagiaant Antigen In Normal, Thrombin- 
treated, and Hemophilic Plasma," PNAS (USA), 78, 
pp. 5137-41 (1981); P. J. Fay et al., "Purification 
And Characterization Of A Highly Purified Human Factor 
VIII Consisting Of A Single Type Of Polypeptide 

15 Chain," PNAS (USA), 79, pp. 7200-04 (1982); C. A. 
Fulcher and T. S. Zimmerman, "Characterization Of 
The Human Factor VIII Procoagulant Protein With A 
Heterologous Precipitating Antibody," PNAS (USA), 
79, pp. 1648-52 (1982); F. Rotblat et al., Thromb. 

20 Haemostasis . 50, p. 108 (1983); C. A. Fulcher et al.. 
Blood , 61, pp. 807-11 (1983)]. 

However, purification has proven to be 
difficult because of the relatively low concentration 
of factor VIII :C in serum, its tight association 

25 with the larger factor VIIIR:Ag and its sensitivity 
to degradation by serum proteases. Factor VlllrC 
when purified from plasma thus contains a heterogen- 
eous mixture of heavy rghaiTia ranging in length from 
1648 amino acids down to 740 amino acids which result 

30 from these numerous proteolytic events [Andersson 
et al., supra , p. 2983]. The heterogenous mixture 
of chains observed in plasma-purified factor VIII :C, 
has made recovery of a substantially pure mature 
factor VIII :C almost in^jossible. Furthermore, tradi- 

35 tional treatment of basophilia with factor VIII 

purified from plasma has serious drawbacks. Specifi- 
cally, it can lead to the unintended transfer of the 
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causative agents of hepatitis or the virus associated 
with Acquired Immune Deficiency Syndrome* 

In view of its importance in the treatment 
of haemophilia, numerous attempts have been made to 
5 produce large quantities of factor VIII:C using 

recombinant DNA technology [See, for exas^le. Genetics 
Institute, PCT application W085/01961; Genentech 
European Patent application 160,457; Chiron European 
Patent application 150^735; J. J. Toole et al*, 

10 "Molecular Cloning Of a cDHA Encoding Human Antihae- 
mophilic Factor" ITature ^ 312, pp. 342-47 (1984); and 
W. I. Wood et al.. Nature , 312, pp» 330-37 (1984)] . 
However, such attempts have proven to be less 
successful than had been hoped. This is partially 

15 due to the fact that the recombinantly produced 2332 
amino acid factor VIII rC chain is subject to proteo- 
lytic cleavage at many positions. It is also due to 
difficulties din producing recombinant factor VIII tC 
in sufficiently high yields. 

20 SUMMARY OF THE INVENTION 

The present invention solves the problems 
referred to above by providing DNA sequences which 
encode modified factor VIII :C and modified factor 
VIII:C-like polypeptides. These DNA sequences code 

25 for polypeptides which are produced in approximately 
twenty-times higher yields than previous recombinantly 
produced factor VII1:C and are more easily purified 
into biochaaiically pure mature factor VIII :C. 

According the present invention, DNA 

30 sequ^ices coding for modified factor VIII :C are pro- 
duced and expressed in high yields. As will be appar- 
ent from the disclosure and exaii5>les to follow, the 
modified factor VIII rC and modified factor VIII:C-like 
polypeptides of this invention are characterized by 

35 deletions removing a major part of the maturation 
polypeptide of factor VIII :C. The DNA sequences in 


wo 88/0083t 


PCT/LS87/01814 


our preferred embodiment: have a dele'tion of substan- 
tially all of the nucleotides coding for the matura- 
tion polypeptide. Our most preferred embodiment 
contains a deletion of all the DNA sequence coding 
5 for the maturation polypeptide. On expression of 
our DNA sequences, the heavy chain of mature factor 
VIII :C is linked directly to the light chain. Follow- 
ing a one-nick proteolytic event, the mature form of 
factor VIII :C is generated. 

10 Finally, the present invention provides 

various anti-haemophilic compositions containing 
modified factor VIII :C and modified factor VIII :C- 
like polypeptides produced by the ONA sequences 
of this invention, and various methods of using 

15 those compositions in haemophilia treatment- 
therapy of acute or prolonged bleeding in 
haemophilia A. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depiqts a restriction map of the 
20 factor VIII :C cDNA. 

Figure 2 is a schematic depiction of the 
construction of the recombinant DNA molecule with 
the QD deletion. 

Figures 3A and 3B depict a schematic repre- 
25 sentation of the construction of the recombinant DNA 
molecule with the RE deletion. 

Figure 4 depicts a restriction endonuclease 
map of the RE deletion inserted into the mammalian 
cell expression vector pBG312 indicating the positions 
30 of the SV40 origin of replication/enhancer, the adeno- 
virus major late promoter, the factor VIII :C cDNA 
with the RjS deletion, the 3 ' untranslated region of 
the factor VIII :C mRNA, and the polyadenylation site. 
Figure 5 depicts the results of an SI 
35 analysis of Factor VIII :C mRNA isolated from trans- 
fected EMTIO cells. 
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Figure 6 depicts the results of a Southern 
analysis o£ plasmid DNA isolated from transfected 
BHno cells. 

Figure 7 depicts the published DNA and 
5 amino acid sequence of factor VIII rC (EPO application 
160,457) • 

DETAILED DESCRIPTION OF THE INVENTION 

In order that the indention herein described 
may be more fully understood, the following detailed 
10 description is set forth. 

In the descrip^on the following terms are 

employed: 

Nucleotide — A monomeric unit of DNA or BNA 
consisting of a sugar moiety (pentose), a phosphate, 

15 and a nitrogenotxs heterocyclic base. The base is 

linked to the sugar moiety via the glycosidic carbon 
(1' carbon of the pentose) and that combination of 
base and sugar is called a nucleoside. The base 
characterizes the nucleotide. The four DNA bases 

20 are adenine ("A"), guanine ("G"), cytosine ("C"), 
and thymine ("T"). The four RNA bases are A, G, C, 
and uracil ("U")* 

DNA Sequence — ^A linear array of nucleotides 
connected one to the other by phosphodiester bonds 

25 between the 3' and 5* carbons of adjacent pentoses. 

Codon~ A DNA sequence of three nucleotides 
(a triplet) which encodes through mRNA an amino acid, 
a translation start signal or a translation termina- 
tion sighal. For example, the nucleotide triplets 

30 TTA, TT6, CTT, CTC, CTA and CTG encode for the amino 
acid leucine ("Leu"), TAG, TAA and TGA are transla- 
tion stop signals and ATG is a translation start 
signal. 

Amino Acid~ A monomeric unit of a peptide, 
35 polypeptide or protein. The twenty amino acids are: 
phenylalanine ("Phe" or "F"), leucine ("Leu", "L"), 
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isoleucine ("He", "I"), methionine ("Met", "M"), 
valine ("Val", "V"), serine ("Ser", "S"), proline 
("Pro", "P"), threonine ("Thr", "T"), alanine 
("Ala", "A"), tyrosine ("Tyr", "Y"), histidine 
5 ("His", "H"), glutamine ("Gin", "Q"), asparagine 
("Asn:N"), lysine ("Lys:K"), aspartic acid ("Asp", 
"D"), glutamic acid ("Glu", "E"), cysteine ("Cys", 
"C"), tryptophane ("Trp", "W"), arginine ("Arg", 
"R") and glycine ("Gly", "G"). 
10 Reading Frame — The grouping of codons during 

the translation of mRNA into amino acid sequences. 
During translation the proper reading frame must.be 
maintained. For example, the DMA sequence 
GCTGGTTGTAAG may be e^^ressed in three reading frames 
15 or phases, each of which affords a different amino 
acid sequence: 

GCT GCT TGT AAG — Ala-Gly-Cys-Lys G 
CTG GTT GTA AG — Leu-Val-Val GC TGG 
TTG.IAA G~Trp-Leu-(STOP> 
20 Polypeptide — A linear array of amino acids 
coxmected one to the other by peptide bonds between 
the a -amino and carboxy groups of adjacent amino 
acids. 

Genome — The entire DNA of a cell or a virus • 
25 It includes inter alia the structural gene coding 
for the polypeptides of the substance, as well as 
operator, promoter and ribosome binding and interac- 
tion sequences, including sequences such as the Shine- 
Dalgamo sequences. 
30 Gene— A DNA sequence which encodes through 
its template or messenger RKA ("mBNA") a sequence of 
amino acids characteristic of a specific pol3^eptide« 

Transcription— The process of producing 
mRNA from a g^e or DNA sequence. 
35 Translation — The process of producing a 

polypeptide from mRNA. 
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Expression — Xhe process undergone by a 
gene or ONA sequence to produce a polypeptide. It 
is a combination of transcription and translation. 

Plasmid^ -'A noncbromosomal double-stranded 
5 DNA sequence comprising an intact "replicon" such 
that the plasmid is replicated in a host cell. When 
the plasmid is placed within a unicellular organism, 
the characteristics of that organism may be changed 
or transformed as a result of the DNA of the plasmid. 

10 For example, a plasmid carrying the gene for tetra- 
cycline resistance (TET^> transforms a cell previously 
sensitive to tetracycline into one which is resistant 
to it. A cell transformed by a plasmid is called a 
"transformant" . 

15 Phage or Bacteriophacre — Bacterial virus ^ 

many of which consist of DNA sequences toc^sidated 
in a protein envelope or coat ("capsid"). 

Cloning Vehicle — A plasmid, phage DNA, 
cosmid or other DNA sequence which is able to repli- 

20 cate in a host cell, characterized by one or a small 
number of endonuclease recognition sites at which 
such DNA sequences may be cut in a deteimdnable 
fashion without attendant loss of an essential bio- 
logical function of the DNA, e.g., replication, pro- 

25 duction of coat proteins or loss of promoter or 

binding sites, and which contains a marker suitable 
for use in the identification of transformed cells, 
e.g., tetracycline resistance or anpicillin resist* 
ance. A cloning vehicle is often called a vector. 

30 Cloning — ^The process of obtaining a popu- 

lation of organisms or DNA sequences derived from 
one such organism or sequence by asexual reproduction. 

Recombinant DNA Molecule or Hybrid DNA — A 
molecule consisting of segments of DNA from different 

35 genomes wh i ch have been joined end-to-end outside of 
living cells and able to be maintained in living 
cells . 


wo 88/00831 


PCr/US87/0l8l4 


-9- 

Expression Control Sequence^ -^A sequence of 
nucleotides that controls and regulates expression 
of genes when operatively linked to those genes. 
They include the lac system, the ^-lactamase system, 
5 the trp system, the tac and trc systems, the major 
operator and promoter regions of phage X, the control 
region of fd coat protein, the early and late pro- 
moters of SV40, promoters derived from polyoma virus 
and adenovirus, metallothion i ne promoters, the pro* 

10 moter for 3-phosphoglycerate kinase or other gly- 
colytic enzymes, the promoters of acid phosphatase, 
e.g., Pho5, the promoters of the yeast a-mating fac- 
tors, and other sequences known to control the ex- 
pression of genes of prokaryotic or eukaryotic micro- 

15 bial cells and their viruses or combinations thereof • 
Factor VIII ;C ~A polypeptide having the 
amino acid sequence of Figure 7, and upon maturation 
and activation, being capable of functioning as 
co-factor for the factor IXa-dependent maturation of 

20 factor X in the blood coagulation cascade. As used 

in this application, factor VIII :C includes the glyco- 
proteins also known as factor VIII procoagulant 
activity protein, factor Vlll-clotting activity, 
antihemophilic globulin (AHG), antihemophilic factor 

25 (AHF), and antihemophilic factor A [see W. J. Williams 
et al.. Hematology , pp. 1056, 1074 and 1081}. 

Maturation Polypeptide --The maturation 
polypeptide of factor VIII :C is made up of the 
908 amino acids from amino acid Ser (741) to amino 

30 acid Arg (1648) (see Figure 7). Maturation of 

factor VIII :C is initiated with a cleavage between 
amino acids 1648 and 1649 (which produces a 
C-terminal light chain) followed by a series of 
nicks which produce the mature N-terminal heavy 

35 chain. 
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Mature Factor VIII :C --As xxsed ia this 
application, mature factor VIII :C is composed of 
an terminal heavy chain (Ala 1- Arg 740 ) linked 
to a C- terminal light chain (Glu 1649-Tyr 2332) 
5 through an alkaline metal bridge, such as calcium 
(Figure 7). 

Modified Factor VIII ;C ~ As used in this 
application, "modified factor VIII :C" refers to poly- 
peptides characterized by a deletion of a major por- 

10 tion of the maturation polypeptide of factor VIII :C. 
For example, where the entire maturation polypeptide 
has been deleted, "modified factor VIII:C" includes 
proteins that coiQ>rise the y-terminal mature heavy 
chain and the C-terminal mature li^t chain of factor 

15 VIII :C linked together as a single chain* 

Modified Factor VIII:C-Like Polypeptide ~ 
As used in this application, "modified factor VIII rC- 
like polypeptide" includes proteins having the bio- 
logical activity of modified factor VHI:C* It 

20 also includes proteins having an amino terminal 

methionine, e.g., f -Met- f actor VHI:C, and proteins 
that are characterized by other amino acid deletions, 
additions or substitutions so long as those proteins 
substantially retain the biological activity of 

25 modified factor villsC. 

"Modified factor VIII:C-like polypeptides" 
within the above-definition also includes 'natural 
allelic variations that may exist and occur from 
individual to individual. Furthermore, it includes 

30 modified factor VIII:G-like polypeptides whose 
degree and location of glycosylation, or other 
post- translation modifications, may vary depending 
on the cellular environment of the producing host 
or tissue. 
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The present invention relates to processes 
for the production of modified factor VIII :C and 
modified factor VIII:C-like polypeptides. More 
pctrticularly, it provides DNA sequences whichr.permit 
5 the production of modified factor VIir:C and: modi- 
fied factor VIII:C-like polypeptides in. hii^ jsielda,. 
in appropriate hosts. Polypeptides p nji l uce felfcg. the - . 
DNA sequences of this invention are- uses&rT ftr^th)^. 
clinical treatment of haemophilia A* • 

10 As compared to factor vriTrC;. tix^modified 

factor VIII :C produced by the DNA Begaessces o£ this 
invention lack a major portion of thei maturation 
polypeptide of factor VIII :C. The DHA: sequences 
of the present invention surprisingly express 

15 modified factor VIII :C in much higher yields than 
DNA sequences coding for factor VIII :C itself. 

While not wishing to be botmd by theory, 
we believe that the DNA sequences of the present 
invention produce modified factor VIII :C in high 

20 yields, because of the absence of most or all of 
the maturation polypeptide. For example, the 
mRNA for the modified gene may be translated more 
efficiently, because the HNA coding for the long 
maturation polypeptide does not have to be trans- 

25 lated. In addition, while factor VIII :C has many 
proteolytic targets which may be attacked while the 
polypeptide is in the cell, the modified factor 
VIII :C is less subject to such proteolytic attack 
because it lacks the proteolytic targets within 

30 the maturation polypeptide. Furthermore, when the 
maturation polypeptide is absent, 19 of the 25 
N-linked glycosylation sites of native factor VIII :C 
are deleted, leaving only six N-liked glycosylation 
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sites on the modified polypeptide (three on the heavy 
chain and three on the light chain). Apparently, 
because there are fewer sites to be glycosylated, 
production and purification of the modified factor 
5 VIII ;C is simplified. 

In the processes of this invention, we 
modify the ONA sequence encoding factor VIII :C to 
delete from it a major portion of the OUA sequence 
encoding the maturation polypeptide. Having pre- 

10 pared a DNA sequence carrying the desired deletion 
we en^^loy it in a varielgf of expression vectors and 
hosts to produce modified factor VIII :C encoded by 
it. For exassapler any of a wide variety of expression 
vectors are useful in expressing the modified factor 

15 VlllrC coding sequences of this invention. It also 
should be understood that DNA sequences encoding a 
modified factor VlllrC-like polypeptide can be 
similarly produced in accordance with this invention. 
Useful expression vectors include, for 

20 example, vectors consisting of segments of 

chromosomal, non-chromosomal and synthetic DMA 
sequences, such as variooxs known derivatives of 
SV40, known bacterial plasmids^ e.g., plasmids from 
E.coli including col El, pCRiV pBR322, pMB9 and 

25 their derivatives, wider host range plasmids, e.g., 
BP4, phage DNAs, e.g., the numerous derivatives of 
phage \, e.g., TM 989, and other ONA phages, e.g., 
1113 and Fileunenteous single stranded DNA phages, 
yeast plasmids such as the 2p plasmid or derivatives 

30 thereof, and vectors derived from combinations of 

plasmids and phage DMAs, such as plasmids which have 
been modified to employ phage DNA or other expression 
control sequences. In the preferred embodiments of 
this invention, we employ pBG3 12, a pBR327-rerated 

35 vector [R. Gate et al.. Cell, 45, pp. 685-98 (1986)1. 

In addition, any of a wide variety of 
egression control sequences — sequences that con- 


wo 88/00831 


PCr/L 587/01814 


trol the expression of a DMA sequence when operatively 
linked to it — may be used in these vectors to 
express the DMA sequence of this invention. Such 
useful expression control sequences, include, for 
5 example, the early and late promoters of SV40, the 
lac system, the tz£ system, the TAG or TRC system, 
the major operator and promoter regions of phage X, 
the control regions of fd coat protein, the promoter 
for 3-phosphoglycerate kinase or other glycolytic 

10 enzymes, the promoters of acid phosphatase, e.g., 
Fho5, the promoters of the yeast a -mating factors, 
and other sequences known to control the expression 
of genes of prokaryotic or eukaryotic cells or their 
viruses, and various combinations thereof. In the 

15 preferred embodiment of this invention, we employ 
adenovirus-2 major late promoter expression control 
sequences . 

A wide variety of host cells are also useful 
in producing the modified factor VIII :C of this inven- 

20 tion. These hosts may include well known eukaryotic 
and prokaryotic hosts, such as strains of E.coli , 
Pseudomonas , Bacillus, Streptomyces , fnngi such as 
yeasts, and animal cells, such as CHO cells, African 
green monkey cells, such as COSl, C0S7, BSCl, BSC40, 

25 and BtCTlO, and human cells and plant cells in tissue 
culture. In the preferred embodiments of this inven- 
tion, we prefer BBCTIO African green monkey cells. 

It should of course be understood that not 
all vectors and expression control sequences will 

30 function equally well to express the modified DNA 

sequences of this invention and to produce otir modi- 
fied factor VIII :C. Neither will all hosts function 
equally well with the same expression system. How- 
ever, one of skill in the art may make a selection 

35 among these vectors, expression control sequences 
and hosts without undue experimentation without 
departing from the scope of this invention. For 
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ezample, in selecting a vector, the host must: be 
considered because the vector must replicate in it. 
The vector's copy number, the ability to control 
that copy number, and the expression of any other 
5 proteins encoded by the vector, such as antibiotic 
markers, should also be considered. 

In selecting an egression control sequence, 
a variety of factors should also be considered. These 
include, for example, the relative strength of the 

10 system, its controllability, and its compatibility 
with the DNA sequence encoding the modified factor 
VIII:C of this invention, particularly as regards 
potential secondary structures. Hosts should be 
selected by consideration of their coii5)atibility with 

15 the chosen vector, the toxicitgr of our modified factor 
VIII :C to them, their secret:ion characteristics, their 
ability to fold proteins correctly, their fermenta- 
t:ion requirements, and the ease of the purification 
of our modified factor VlllrC from them and safety. 

20 Within these paramieters one of skill in 

the art may select vanous vector/expression control 
system/host combinatixons that will produce useful 
amoimts of our modified factor VIII :C on fermenta- 
tion. For example, in one preferred embodiment of 

25 this invention, we use an pBG312 vector, with an 
adenovijnzs 2 major late promoter expression systiem 
in EMTIO African green monkey cells. 

The modified factor VtII:C and modified 
factor Vlll-like polypeptides produced according 

30 to this invention may be purified by a variety of 
conventional steps and strategies. Useful purifi- 
cation steps include those used to purify natural 
and recombinant factor VllliC [see, for example, 
L.-O. Andersscm et al., PNAS (USA), 83, pp. 2979-r83 

35 (1986)1. 

After purification the modified factor 
VIII rC and modified factor VIII:C-like polypeptides 
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of this invention are useful in composition and 
methods for treatment of haemophilia A and in a 
variety of agents useful in treating uncontrolled 
bleeding. 

5 While the modified factor VIII :C and 

modified factor VIII:C-like polypeptides of this 
invention may be administered in such compositions 
and methods in the form in which they are produced, 
as single chain polypeptides, it should also be xmder- 

10 stood that it is within the scope of this invention 
to administer the modified factor VIII :C after sub- 
jecting it to proteolytic cleavage- For exanple, 
modified factor VIII :C can be cleaved in vitro , into 
the heavy chain and light chain of mature factor 

15 VIII :C and linked with a calcuim or other alkaline 
metal bridge, before, during or after purification. 

The modified factor VIII :C and modified 
factor VIir:C-like polypeptides of this invention 
may be formulated using known methods to prepare 

20 pharmaceutically useful compositions* Such composi* 
tions also will preferably include conventional 
pharmaceutically acceptable carriers and may include 
other medicinal agents, carriers, adjuvants, 
ezcipients, etc., e.g., human serum albtimin or plasma 

25 preparations. See, e.g.. Remington's Pharmaceutical 
Sciences (E. W. Martin). The resulting formulations 
will contain ah amount of modified factor VIII :C 
effective in the recipient to treat uncontrolled 
bleeding. Administration of these polypeptides, or. 

30 pharmaceutically acceptable derivatives thereof, may 
be via any of the conventional accepted modes of 
administration of factor VIII. These include paren- 
teral, subcutaneous, or intravenous admdLnistration. 

The compositions of this invention used in 

35 the therapy of haemophilia may also be in a variety 
of forms. The preferred form depends on the intended 
mode of administration and therapeutic application. 
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The dosage and dose rate will depend on a variety of 
factors for exaii^le, whether the treatment is given 
to an acutely bleeding patient or as a prophylactic 
treatment • However, the factor VlllrC level shoiild 
5 be high enough to prevent hemorrhage and promote 
epithelialization [see discussion in Williapis, 
Hematology ^ pp. 1335-43]. 

In order that this inv^tion may be better 
understood, the following exaniple is set forth. 
10 This example is for purposes of illustration only 
, and is not to be construed as limiting the scope of 
the invention. 

EXftMPLE 

We have constructed cDNA sequences which 
15 encode modified factor VIII :C molecules having a 

deletion of a major part of or all of the maturation 
polypeptide. To test the limits of our invention, 
we also constructed a cDHA segu^ice which encodes a 
polypeptide having a deletion of more than just the 
20 maturation polypeptide of factor VIII :C. 

A. ASSEMBLY OF THE FDIiL-I£NGTH 
FACTOR VIII rC cDNA ^ 

Referring now to Figure 1, we have presented 

therein a restriction enzyme m^ of the- factor VIII rC 

J" 

25 cONA based upon the published segu^ce [W. I . Wood 
et al.. Nature . 312, pp. 330-37 (1984); (Figure 7)1. 
The bar represents the coding sequence. Below the 
restriction enzyme map we have depicited the amino- 
terminal heavy chain of mature factor VIII:C attached 

30 by a calcium bridge to the carboxy-terminal light 
chain of mature factor VIII rC. Below the protein 
model on a bar congruent to the restriction enzyme . 
map we have indicated the oligonucleotide probes 
(indicated with asterisks) which we used to screen 

35 human placenta, liver, and kidney cDNA libraries. 
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These libraries were made using oligo (dl) as first- 
strand primer and \gtlO as vector. 

On this second bar are also located the 
oligonucleotide primers (left-arrows) which we used 
5 to initiate first-strand cDNA synthesis using human 
kidney mRNA as tacnplate. We made these singles- 
stranded cDNA sequences double-stranded by the 
technique of Gubler and Hoffman [U. Gubler, and 
B. J. Hofflaan, G6ne , 25, pp. 263-69 (1983)]. We 

10 cloned them at the dC- tailed EcoR V site in pBR322. 
We then screened this plasmid-based kidney cDNA 
library with oligonucleotide probes located on 
the bar 5' to the oligonucleotide primers. 

Below the primer/probe bar in Figure 1, we 

15 have displayed a collection of partial-length factor 
VIII :C cDNA and genomic subclones, which we isolated 
from these libraries. Together these encode the 
full-length CDMA gene. More information about these 
clones is presented below, in Table 1. 
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TABLE 1 


COMPENDIUM OF FACTOR VlllrC 
GENOMIC AND CDHft CLONES 

isolated from a genomic library constructed with 
5 cosmid pTCF [Grosveld, F.G. et. al., Nticleic 

Acids Research, 10, pp. 6715-6732 (1982)1 

subclone length tissue- 

pnCl9.2874 2874 bp 48,XXXX 

human lynphoblast 

10 isolated from oligo(dT) -primed XcrtlO cDNA libraries 

clone length probe hybridization 

1.7977 (placenta) 1728 79+, 77+ 

2.73 (liver) -700 73+ 

4.73 (kidney) -220 73+ 

15 isolated from a 85, 86-primed Gubler-Hoffioan 
kidney cDNA library 

probe hybridization 

82+, 79-, 77- 

82+ 79— 77— 

20 3-7573 -2700 74-^ 75+^ 73+ 

74-, 75+, 73+ 

74-, 75+, 73+ 

. 82-, 79+, 77+, 83+ 

82-, 79+, 77+, 83+ 

25 12.797783 >1263 82-, 79+, 77+, 83+ 

82-, 79+, 77+, 83+ 

isolated from a 75, 77-primed Gubler-Hdffinan 
kidney cDNA library 

clone length probe Iqrbridization 

30 7.7475 -2700 74+, 75+ 


clone 

length 

1.82 

-1200 

2.82 

-1200 

3-7573 

-2700 

4.7573 

-2700 

6.7573 

-2700 

10.797783 

>1263 

11.797783 

>1263 

12.797783 

>1263 

13.797783 

>1263 
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Prior to etssembling the full-length cDMA 
gene, we constructed two intermediate plasmids. 
This was necessary because of the excessive length 
of the factor VIII :C cDNA. For our first preliminary 
5 construction we isolated a fragment from clone 7.7475 
extending from the PstI site at 5163 to the PstI 
site at 5755. We inserted this fragment into clone 
4.7573 at the Pst I site at 5755 thereby extending 
clone 4.7573. This PstI site is shared by the inserts 

10 of both clone 7.7475 and clone 4.7573. By extending 
clone 4.7573 in this manner, we provided a unique 
Ndel site at 5522 in the insert of this derivative 
of clone 4.7573. We needed to create this Nde site 
because we needed a unique site at which to extend 

15 the length of this insert at its 5' end. 

As a second preliminary construction we 
introduced a polynucleotide linker in clone 2.82 at 
a location immediately 5* to the translation start 
codon of the signal sequence of factor VIII :C. The 

20 insert of clone 2.82 is at the EcoR V site of pBR322 
a nd its orientation is opposite to that of tetra- 
cycline resistance. The 5' endpoint of the insert in 
clone 2.82 is at -133 in the 5* untranslated leader 
sequence. We cleaved clone 2.82 at the Sai l site in 

25 tetracycline resistance and at the Sad site in the 
sequence encoding the signal peptide in the insert 
of clone 2.82 and inserted the synthetic duplex 

BH 
BSGNSS 

30 ACM R C APISAS 

NIAPCT 
221211 

///// 

GTCGACTCGCGACCATGGATGCAAATAGAGCTC 

35 1 + + + 33 

(aGCTdAGCGCTGGTACCTACGTTTATCTCGAG 

MetGlnl leGluLeu 

This ligation resulted in the introduction of a 
Sal l-Nrul-Ncol polylinker immediately 5' to the start 


H 



SAI 

N 

N 

ACN 

R 

C 

LCC 

U 

0 

112 

1 

1 
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codon which initiates translation of the signal 
sequence of factor VIII :C. These three restriction 
enzymes do not cleave the fall-length factor VlilrC 
cDNA gene* 

5 With these two intermediate constructions 

available, we cissembled the full-length factor VIII :C 
CDHA in a six-fragment ligation reaction (bottom 
Figure 1) . It was necessary to create the full length 
DNA in this manner because we never isolated the 

10 full DNA in one single clone. We isolated fragment 
1 from the above-described derivative of clone 2.82. 
Fragment 1 extended from Sai l in the polylinker to 
Ava l at 731* We isolated Fragment 2 from the insert 
in the \gtlO recombinant 1*7977. Fragment 2 extended 

15 from Ava l at 731 to EcoR I at 2289. Fragment 3 derived 
from the subclone pUC19.2874 of a genomic cosmid. 
recombinant; it extended from EcoR I at 2289 to BamH I 
at 4743. Fragment 4 was isolated from clone 7.74575, 
starting from the BamF I site at 4743 and extending 

20 to the Ndel site at 5522. We isolated fragment 5 

from the above-described derivative of clone 4.7573. 
Fragment 5 extended from Ndel at 5522 to Nco l at 
7991. Fragment 6 is an assembly vector containing 
an E.coli replication origin and selectable marker 

25 for ampicillin resistance. We isolated Fragment 6 
from pAT.SV2.tPA, a gift from Richard Fisher. This 
is a plasmid in which the transcription of the tPA 
gene is under the control of the SV40 early promoter. 
We digested pAT.SV2.tPA with Sail which cleaves within 

30 the tetracycline resistance marker , and with Nco l 
which cleaves wit2iin the SV40 early region. 

Of the 96 recombinants we analyzed^ 32 
contained all five factor VIII :C restriction frag- 
ments. We determined the DNA sequ^ce of one of 

35 these clones, and we identified two changes with 

respect to the published sequence. One is a CTG to 
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CTA change at Leu 242 and the other is a TTC to CTC 
change at amino acid residue 1880 (Phe to Leu) 
(compare Figure ?)• 

B. INSERTION OF THE FULL-LENGTH cDNA 
5 INTO A MAMMALIAN CELL EXPRESSION 
VECTOR 

We excised the full-length factor VIII :C 
cDNA gene from the assembly vector by digestion with 
Ncol. We then treated the resultant Ncol restriction 

10 fragment with nuclease SI to create a blunt end- We 
ligated this fragment to Sma l-digested pBG312. pBG312 
is an qpl Tnai cell expresion vector whose construction 
has been described elsewhere [R. Gate et al.. Cell , 
45, pp. 685-98 (1986)]. The sequence of BG312, from 

15 EcoR I to BamTTT has (clockwise): a SV40 replication 
origin; an adenovirus-2 major late promoter and com- 
plete tripartite leader [S. Zain et al.. Cell, 16, 
pp. 851-61 (1979)]; a hybrid splice signal consisting 
of an adenovirus-5 splice donor and an immunoglobulin 

20 variable region gene splice acceptor £R. J. Kaufman, 
and P. A. Sharp, J. Mol. Biol. , 159, pp. 601-21 
(1982)]; a polylinker containing sites, for Hind lll, 
Xho l, EcoRI, Smal, Nde l, Sst I, and Bglll; the SV40 
small t antigen intron flanked by its splice donor 

25 and acceptor; and the SV40 early polyadenylation 
site. 

We verified the DNA sequence across the 
junction between the polylinker of pBG312 and the 
cDNA gene encoding factor VIII :C including the signal 

30 sequence for two independent clones: 8.1 and 8.2. 
Clone 8.1 differs from 8.2 in the 3* untranslated 
region; T 7806 is fused to the Sma l site of pBG312 
in clone 8.1 instead of the C of the Nco l site at 
7990 in clone 8.2. In addition, we isolated another 

35 clone, in which the fusion of the cDNA gene encoding 
factor VIII :C to pBG312 had occured within the 
sequence encoding the signal peptide of factor 


wo 88/00831 


PCT/US87/01814 


-22- 

VIII :C. This clone, which we named signal-minxis, 
provided a negative control for our transient 
expression assays, described below. 

C. CONSTRUCTION OF GLN 744 - ASP 1563 
S (ABBREVIATED QD) DELETION 

In this section we demonstrate how we created 
the QD deletion which removes a portion of DNA sequence 
coding on expression for the maturation polypeptide 
(amino acids 741-1648). The QD deletion retains 

10 approximately 90 amino acids of the maturation poly-* 
peptide (four amino acides at the N-terminal end of 
the maturation polypeptide and 86 amino acids at its 
carboy terminal end). 

Referring now to Figure 2, we depict therein 

15 the construction of the QD deletion. We partially 
digested one aliquot of the expression plasmid for 
the full-length factor VIII :C gene with EcoR I> This 
endonuclease cleaves between the codons for Gin 744 
and Asn 745. We removed the 5'AATT overhang with 

20 nuclease Sl, and then subjected the plasmid to com- 
plete digestion with Pyul within the atnpicillin 
resistance gene. We partially digested another 
aliquot with BamH T , which cleaves between the codons 
for Leu 1562 and Asp 1563 (see Figure 7). We filled 

25 out the 5 'CATC overhang with the Klenow fragment, 

and again digested the plasmid with Pvul within amp . 
We then combined the two mixtures of fragments and 
ligated them with T4 DNA ligase* A B^^tT n site between 
the codons for Gin 744 and Asp 1563 was created in 

30 this fusion. 

The modified polypeptide produced on expres- 
sion as a result of the QD deletion lacks 818 amino 
acids from within the 908 ami no-acid maturation poly- 
peptide, leaving 4 amino acids Conterminal to the 

35 carboxy terminus of the mature heavy chain, Arg 740, 
and leaving 86 amino acids N- terminal to the amino 
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temiinus of the light chain, Glu 1649 (Figxire 7). 
The 908 amino-acid maturation polypeptide is thus, 
replaced by a 90 amino-acid maturation polypeptide, 
with the protease substates for both initial 
5 maturation of the primary translation polypeptide 
and subsequent maturation of the heavy chain 
remaining intact. 

D. CONSTRUCTION OF THE AUG 740 - GLU 1649 
(ABBREVIATED RE) DELETION 

10 We demonstrate in this section how we 

created the RE deletion, which removes the entire 
DMA sequence coding for the maturation polypeptide- 

Referring now to Figures 3 A and B, we show 
how we obtained this RE deletion fusion in two steps. 

15 In the first step we ligated four fragments which 
restilted in an intermediate plasmid. These four 
fragments were: 

(1) the 462 bp fragment, obtained by 
digesting the e^qpression plasmid for the full-length 

20 gene with Hindlll between the codons for Arg 740 and 
Ser 741/ removing the 5» AGCT with nuclease Sl, and 
subsequently digesting with Kpn l which cleaves 
uniquely between the codons for Tyr 586 and Leu 587* 

(2) the synthetic oligonucleotide duplex 

25 fragment 

5'pGAA ATA ACT CCT ACT ACT CTT CAG TCA 

CTT TAT TGA GCA TGh TGA GAA CTC ACT CTA 5* 
Glu lie Thr Arg Thr Thr Leu Gin Ser Asp 
1649 1657 
30 (3) the 135 bp fragment obtained by digest- 

ing the expression plasmid for the full-length gene 
first with Sau3 A; we isolated the 411 bp fragment 
which resulted from SauS A digestion between the codons 
for Ser 1657 and Asp 1658 and between the codons for 
35 Glu 1794 and Asp 1795. Then, we digested the 411 bp 
fragment with PstI which cleaves between the codons 
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for Ala 1702 and Val 1703, to obtain the 135 bp 
5' fragments 

(4) pUCia digested with Kten I and Pst l. 

We then isolated a fragment encoding the 
5 RE fusion from this intermediate plasmid. To do 

this, we digested the intermediate plasmid generated 
in the four-fragment ligation with Asp7 18 and Pstl. 
The fragment encoding the BE ftision was used to 
replace the corresponding fragment in the expression 

10 plasmid for the QD fusion. We ligated the resultant 
624 bp f jf-agment encoding the EE fusion to the mixture 
of fragments which we obtained by first cos^letely 
digesting the expression plasmid for the QP internal 
deletion at the unique Asp7 18 site, next dephos- 

15 phorylating the 5' CTAC overhang with calf intestinal 
phosphatase, and then partially digesting the plasmid 
with Pstl. 

Referring now to Figure 4, we depict therein 
a map of the RE deletion inserted into pBCai2. . In 

20 the modified polypeptide produced on expression the • 
908 amino-acid maturation polypeptide is entirely 
removed. The novel polypeptide produced by this 
recombinant molecule cell is secreted, and may be 
purified as a single chain, i.e.^ the heavy chain is 

25 linked directly to the li^t chain. Because the Arg 

1648 • du 1649 peptide bond which is normally cleaved 
during the initial nicking of the full-length primary 
translation product is preserved in this deletion, 
the primary translation product for this internal 

30 deletion is nicked by the same protease that initiates 
nicking of the full-length primary translation product,, 
thus producing directly the mature form of the heavy 
chain of factor VIII :C. Our Western blot analysis 
(data not shown) confirms that the RE modified factor 

35 VIII :C encodes a single chain molecule which is then 
processed into a 90K heavy chain and an 80K light 
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chain in the culture medium. The resultant light 
chain possesses the peptide from Glu 1649 to Arg 
1689 that hinds the two-chain complex to von 
Willebrand protein. For this reason, this recombinant 
5 product, when secreted from a mammalian cell, will 
bind to the von Willebrand protein present in cell 
culture fluid. Similarly, when injected, it will 
complex to and circulate with plasma von Willebrand 
protein. Upon thrombin cleavage at Arg 1689 - Ser 
10 1690, the two-chain mature factor VIII :C will be 
activated and will dissociate from von Willebrand 
protein and assemble into its ternary complex with 
factor IXa and factor X on a platelet surface. 

E. CONSTRUCTION OF THE ARG 740 - SER 1690 
15 (ABBREVIATED RS) DELETION 

In order to test the outer limits. of these 
deletions, we constructed a plasmid which codes for 
a polypeptide with a deletion of more than the 
maturation polypeptide alone (i.e., we deleted the 

20 DNA sequence which codes on egression for the forty- 
one amino acids at the N- terminal end of the light 
chain of mature factor VIII :C). 

We constructed this RS ftision with the 
two-step strategy described above for the RE fusion. 

25 Our first step was a three- fragment ligation resulting 
in an intermediate plasmid. The three fragments 
which we ligated were: 

(1) the 462 bp fragment, obtained by digesting 
the e3q)ression plasiaid for the full-length gene with 

30 Hindi 1 1 between the codons for Arg 740 and Ser 741, 

removing the 5» AC3CT with nuclease Si, and subsequently 
digesting with Kpn l which cleaves uniquely between 
the codons for Tyr 586 and Leu 587 . 

(2) the synthetic oligonucleotide duplex 
35 fragment: 
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5' pAGC TTT CAA AAG AAA ACA CGA CAC TAT TTS: ATT GCT GCA 
TCG AAA GTT TTC TTT TGT GCT GTG ATA AAA TAA CC^ 5' 
Ser Phe Gin Lys Lys Thr Arg His Tyr Fhe He Ala Ala 
1690 1702 
5 (3) pUClS digested with Kjpn l and Pstl. 

In this fusion, we recreated the Hind lll site hetween 
the codons for Arg 740 and Ser 741 (now Ser 1690)* 

We isolated a fragment encoding the RS 
fusion from this intermediate plasmid and used this 

10 fragment in our second step to replace the corre- 
sponding fragment in the expression plasmid for the 
OD fusion*. In this second step, we isolated a 501 bp 
fragment encoding the RS fusion. We digested the 
intermediate plasmid with Asp7 18 and Pstl and isolated 

15 the fragment encoding the RS fusion. We then used 
the strategy described above for the RE fusion to 
replace the related fragment in the expression plasmid 
for the QD fusion with the 501 bp fragment. 

In addition to removing the entire 

20 maturation pol3fpeptide, the RS deletion removes DKA 
coding for the Glu 1649 - Arg 1689 peptide, the puta- 
tive von Willebrand binding domain. For this reason 
this recombinant molecule will not attach to circu- 
lating von Willebrand protein when it is secreted 

25 from an animal cell into culture fluid or when it is 
injected into a recipi«it* 

F. IR ANSFE CIIQg OF AFRICAN GSEEH 
MONKEY KHMET CEUtS 

We trans fected BKECLO cells [R. D. Gerard 
30 and Gluzman, Mol. Cell> Biol> , 5, pp. 3231-40 

(1985)1 with the supercoiled expression plasmid. We 

used the DEAE^dextran technique [L. M. Sompayvac and 

J. Danna, PMAS r 78, pp. 7575-78 (1981)] and 
chloroguine [H. Luthman and G. Magnusspn, Nticleic 
35 Acids Research ^ 11, pp. 1295-1308 (1983)] to trans- 
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fect the cells. Trans fectants are known to replicate 
the input expression plasmid to high copy number 
because SV40 T antigen is inducibly supplied in trans 
by EMTIO cells and binds to the SV40 origin of repli- 
5 cation linked to the modified factor VIII :C gene in 
the expression plasmids* However this technique is 
inefficient because, typically, only several percent 
of the trans fected cells will actusaiy incorporate DNA. 
The transfectants will secrete modified 
10 factor VIII :C for up to 120 hours. For most experi- 
ments, the cmVinl ratio is approximately 5*5; that 
is, a confluent monolayer of BHTIO transfectants in 
a 100 mm Petri dish (55 cm^) is covered with 10 ml 
culture fluid. 

15 G. FACTOR VIII ;C ACTIVITY ASSAY 

We assayed the signal-minus, 8.1, QP, RE 
and RS expression constructs for factor VIII :C produc- 
tion after trans fection in duplicate into EMTIO cells. 
We used a 96-well plate adaptation of KabiVitrum*s 

20 Coatest® Factor VIII :C. One petri dish was used to 
prepare RNA for SI analysis and the other petri dish 
was used to prepare Hirt DNA used in our Southern 
analysis. After 120 hours of incubation we assayed 
the cell culture fluids for factor VIII :C activily. 

25 We expressed our results in terms of % plasma level, 
where plasma factor VIIi:C concentration is approxi- 
mately 200 ng/ml^ 

In repeated trans fections, both the signal- 
minus construct (negative control) and the RS deletion 

30 have shown no detectable factor VIII :C activity. 
This may be e:q>lained by the deletion of the von 
Willebrand protein binding domain in the RS deletion. 

In the 120 hour experiment analyzed below, 
cells transfected with the foxll-length gene produced 

35 approximately 5% of the activity observed with both 
the QD and the RE deletions. The activity observed 
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with the QD deletion was 1.46% plasma level and that 
for the BE deletion was 1*30% plasma level. 

Thus, we observed that BI£riO cells trans* 
fected with the QD and RE deletions produce at least 
5 20 times more factor VIII :C than cells transfected 
with the full-length gene. 

H. NUCLEASE SI ANALYSIS OF 
FACTOR VIII:C mRNA 

In order to deteanaine the levels of mRNA 

10 in each construction, we conducted a nuclease SI 
analysis . This assay assists in the determination 
of the reason for the increased level of expression 
in our QD and RE deletions. 

We isolated RNA from 100 mm Petri dish 

15 cultures of EMTIO cells 120 hours after transfection, 
using the unpublished method of W. Schleuning and 
J. Bertonis. Briefly, according to this method, we 
lysed BKnO cells with 3 ml of 50 mM IrisrECl (pH 
7.5) - 5 mM EDTA - 1% SDS containing 100 ^ig/ml 

20 proteinase K for 20 minutes at 37^C. We transferred 
the lysate to a 50 ml conical tube containing 3 ml 
of phenol and then mechanically sheared the DNA for 
15 seconds at high speed in a Polytron (Brinkmazm 
Instruments ) . We extracted the aqueous phase with 

25 ether and adjusted it to 0.25 NaCl. We precipitated 
the nucleic acid fraction at 4®C, by the addition of 
an equal volume of isopropanol, collected it by 
centrifugation and redissolved it in 3 ml of water. 
We selectively precipitated RNA overnight at 4**C, by 

30 adjusting the solution to 2.8 H LiCl. 

We determined the amount of modified factor 
VIII :C mRNA for each construction. We isolated probes 
for the SI analysis by digesting the QD expression 
plasmid with Esp l. We labelled the 5' ends of the 

35 Esp l fragments with [y- P]ATP and T4 polyuucleotide 

kinase^ and annealed 10 pg RNA to 5000 cpm of the 
32 

P-antisense strand of the 477 nucleotide Esp l frag- 
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ment isolated on a 5% strand separation gel [A. M. 
Maxam and W. Gilbert, Methods In Enzymology / 65, 
pp. 499-560 (1980)] • We incubated the RNA overnight 
at 48 ®C in 10 pi 80% deionized formamide - 400 mM 
5 NaCl - 40 mM PIPES (pH 6.4) - 1 mM EDTA. The hybrid 
molecaaes were then digested for 60 minutes at 37**c 
by adding 190 pi nuclease Si at a concentration of 
100 units/ml in 0.28 M NaCl - 50 mM NaOAc (pH 4.6) - 
4.5 nM ZnS04. We terminated the digestion by adding 

10 EDTA to 10 mM and extracting with phenol. We 

denatured the protected fragments and subjected them 
to electrophoresis on a 5% strand separation gel. 
We exposed the dried gel to Kodak XAR-5 X-ray film 
backed by a Lightning-Plus intensifying screen 

15 (Dupont) overnight at -70^*0. The 477 nucleotide 
Esp l fragment has one end within the hybrid intron 
spliced out from the 5' xintranslated region of the 
factor VIII :C mRMA [R. J. Kaufman and P. A. Sharp, 
J. Mol. Biol. , 159, pp. 601-21 (1981)] and the other 

20 end within the codon for Ala 62 (Figtire 4). 

We detected modified factor VIII :C mRNA by 
protecting a single-stranded 300 nucleotide DNA frag- 
ment from digestion. The experiment was repeated 
with 1 \xg RNA in order to verify that the single- 

25 stranded probe was in excess. 

The results of nuclease SI analysis of 
modified factor VIII :C mRNA for each construct are 
shown in Figure 5. Our results indicated that modi- 
fied factor VIII :C mRNA levels are the same for all 

30 three deletions and the full-length factor VIII :C 
gene. Figure 5A is the analysis for 10 \ig of ii^ut 
RNA, and Figure 5B is the analysis for 1 pg of input 

RNA Lane 1 in both figures contains as marker 500 

cpm of the labeled 477 nucleotide single-stranded 

35 DNA fragment used to protect modified factor VIII :C 
mRNA from Si digestion; that is, 10% of the input to 
each hybridization reaction. Lane 2 contained RNA 
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isolated from BMTIO cells trans fected with the signal- 
minus construct; lane 3, BICTIO cells transfected 
with the full-length factor 7III:C cDNA (construct 
8.1); lane 4: BMIIO Cells transfected with modified 
5 factor VIII :C cDNA (QD deletion); lane 5: EMTIO cells 
transfected with modified factor VIII :C CDMA (RE 
deletion); lane 6: BHTIO cells transfected with the 
cDSSA. from the RS deletion; lane 7: marker fragments 
obtained bj digesting pBR322 irith Hinf l and labeling 

10 their 3' ends with [a- dAIP and Klenow enzYme (a 

gift of Richard Tizard) . . Equal amounts of a protected 
fragment of the ejected length of 300 bases are 
evident in both figures for the 8.1, QO, RE, and RS 
constructs. A protected fragment of approximately 

15 220 bases in length for the signal-minus construct 
is evident in both figures, reflecting the absence 
of a portion of the DNA sequence encoding the signal 
peptide. 

A comparison of Figures 5A and 5B demon- 
20 strates that the iaput 477 probe is in molar excess 
during the hybridizations for each construct* 
Although the modified factor VlllrC activity levels 
are at least 20-fold higher for the QP and RE dele- 
tions compared to the RS and the full-length con- 
25 structs, the amoxuit of mRNA in all , fdiir" constructs 
is very nearly the same. Therefbre^ the reason for 
the increase in egression for the QD and. RE deletions 
is post- transcriptional in nature. 

I. SOUTHERN ANALYSIS OF FLASHID 
30 DNA ISOLATED FROH TRANSFECEED 
EMTIO CKTiriS 

We conducted this analysis to determine 
the DNA levels of newly-replicated modified factor 
VIII :C plasmids for our deletions, in coii5)arison 
35 with the full-length gene. Again, this assay assisted 
in our determination of the reason for the high yields 
of modified factor VlllrC in our QD and RE deletions. 
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In order to control for differences in DMA 
replication in BMTIO cells for the various constructs, 
we performed a Southern analysis of extrachromosomal 
DMA isolated from each transfection. We isolated DMA 
5 from 100 mm petri dish cultures of EMTIO cells 120 
hours after transfection according to the method of 
Hirt [B. Hirt, J. Mol. Biol. , 26, pp. 365-69 (1967)]. 
For each construction, we digested 0.5 A260 units 
with Dpn l to distinguish newly-replicated (Dpnl- 

10 resistant) DMA from input methylated bacterial DMA 
( Dpn I-sensitive) . We electrophoresed the DMA frag- 
ments on a 0.7% agarose gel, and blotted t hem to 
GeneScreen Plus to analyze the DMA. The filter was 
hybridized at 65"»C in 1 M NaCl - 50 mM Tris-HCl (pH 

15 7.5) - 0.1% sodium pyrophosphate - 0.2% polyvinyl- 
pyrrolidone - 0.2% Ficoll - 0.2% BSA - 1% SDS using 
10^ cpm/ml denatured probQ. We then washed the 
filter at 65 with the same buffer and exposed it 
overnight at -70«C to Kodak XAR-5 X-ray film backed 

20 by a Lic^tning-Plus intensi^ing screai (Dupont). 

The factor VIII :C probe was the 2924 bp Esp l fragment 
isolated from the RE ejcpression plasmid (see Figure 4) 
and ^^P-labeled to a specific activity of 10^ cpm/pg 
by the random hexadeo^fnucleotide primer method of 

25 Feinberg and Vogelstein [A. P. Feinberg and 

B. Vogelstein, Anal. Biochem. , 132, pp. 6-13 (1983)]. 

Our results, which are depicted in Figure 6, 
indicate that newly-replicated modified factor VIII :C 
plasmid DMA levels are the same for all three dele- 

30 tions and the full-length gene. Lane 1 contained the 
1 kb ladder obtained fron BRL and labeled with T4 
DMA polymerase according to the manufacturer's proto- 
col; lane 2: 1 ng supercoiled RE DMA; lane 3 :. 10 ng_ 
supercoiled RE DNA; lane 4: 10 ng RE DMA digested 

35 with Dpn l; lane 5: Dpnl digest of 0.5 A260 units 

Hirt fraction obtained from BMTIO cells transfected 
with the signal-minus construct; lane 6: transfected 
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with the full-length factor VllltC cDNA (construct . 
8.1); lane 7: trans fected with the QP deletion; lane 
8: transfected with the RE deletion: lane 9: trans- 
fected with the RS deletion. Figure 6 shows nearly 
5 ' equal amounts of the supercoiled form of each con- 
struct after digestion with Dgnl (lanes 5-9), thus 
excluding the possibilitjr that differences in 0N& 
replication enhance the expression of the QD and BE 
deletions. Lane 2 contains 10 molecules of the RE 
10 construct and lane 3 contains 10 molecules, suggest- 
ing that the copy number is approximately 10 in the 
^proximately 10^ cells successfully transfected. 

J. CX)NSTRUCTION OF ARG 740-ASP 1658 
(ABBREVIATED RD) DELETION - 

15 In this section, we demonstrate how we 

created the RD deletion which removes the DKA 
sequence coding on expression from Ser 741 to Ser 
1657. We constructed this RD deletion fusion in 
three steps. In the first step, we digested plasmid 

20 QD (Figure 2) with Sau3A between the codons for Ser 
1657 and Asp 1658 and betwe^ 61u 1794 and Asp 1795. 
This produced a 411 bp fragment. We also linearized 
plasmid tsa pML [L. Daiiey et al., J. Virol. 54 , 
pp. 739-49 (1985)1 at the unique. Bell site. We then 

25 ligated the 411 base pair fragment derived from 

plasmid QP with T4 DNA ligase (the .lig^e for this 
and the following examples) to the linearized tsa 
pML at the unique Bel l site to generate plasmid 
411. Bell, which contains the Bell site on the Asp 

30 1658 side of the 411 bp insert (i.e., 5* to the 

sequence encoding Asp 1658). Plasmid 411 . Bel l may 
be linearized iiniquely with Bell , resulting in a 5* 
GATC overhang which consists of the GAT codon for 
Asp 1658 and the first base of the CAA codon for Gin 

35 1659. 
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We also digested plasmid QD with Hind i 1 1 
to cleave the plasmid between Arg 740 and Ser 741 
and within the codon for Glu 321 to generate a 1258 
bp fragment. We then removed the 5' AGCT overhang 
5 with nung bean nuclease and ligated it to the Bcll - 
linearized 411 . Bell fragment which had previously 
been rendered flush by treatment with Klenow enzyme 
and all four deoxynucleoside triphosphates. This 
resulted in plasmid HD.411, which contains an Asp7 18 

10 site 5' to the ftision site within the 1258 bp Hind i 1 1 
fragment. RD.411 contains a PstI site 3' to the 
fusion site within the 411 bp Sau3A fragment. 

Subsequently, we digested plasmid RE 
(Figure 3B) with Asp7 18 to cleave within the codon 

15 for Trp 585. 

We then dephosphorylated the 5' GTAC over- 
hang with calf intestinal phosphatase and then par- 
tially digested with Pstl. This partial digestion 
cleaved the linearized RE plasmid between the codo&s 

20 for Ala 1702 and Val 1703, thus removing a 628 bp 
fragment spanning the RE fusion. 

We then cleaved plasmid RD.411 with Asp7 18 
and Pst l to generate a 601 bp fragment spanning the 
RD fusion. We then ligated this fragment to the 

25 Asp718-cleaved, Pstl -partially cleaved RE plasmid 
DNA to generate plasmid RO. As demonstrated below, 
plasmid RD directed the expression of a factor VIII 
polypeptide with a fusion between Arg 740 and Asp 
1658. Cleavage of the RD polypeptide after Arg 740 

30 generates a twochain factor VIII molecule with a 

mature heavy f^ hgi n calciumbridged to a 59 light ch a in , 
i.e. a light chain lacking the first 9 amino-terminal 
amino acids . 

K. CONSTRUCTION OF ARG 740-SER 1657 
35 (ABBREVIAIED RSD DELETION) 

In this section, we d^onstrate how we 
created the RSD deletion which removes the DNA 
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sequence coding on expression for Ser 741 to Gin 
1656 of the mature polypeptide. Initially, we con- 
structed plasmid 411. Bel l and lineeurized it with 
Bel l as described in example "J". Subsequently, we 
5 digested plasmid QD with Hindlll, cleaving the plasmid 
between the codons for Arg 740 and Ser .741 and within 
the cpdon for Glu 321 to generate a 1258 bp fragment. 
We preserved the AGC codon within the 5' AGCT over- 
hang with Klenow enzyme and dATP, dfUS and dCTP and 

10 then renoved the leftover 5 ' T overhang .with mimg 

bean nuclease. We then ligated this modified HindX II 
fragment to Bcll-linearized 411 .Bell, which had been 
previously treated with Klenow enzyme and all four 
deo^On^neleoside triphosphates, to produce plasmid 

15 RSD.411, which contains an Asp7 18 site 5' to the 

fusion site within the 1258 bp ffind lll fragment and 
a PstI site 3' to the fusion site within the 411 bp ■ 
Sau3 A fragment. 

We then prepared Asp7 18 -cieaved , Pst I- 

20 partially cleaved RE plasmid DNA as described in 
example "D" . Subsequently, we cleaved plasmid 
RSD.411 with Asp7 18 and PstI and ligated the result- 
ing 604 bp fragment spanning the RSD fusion to the 
Asp 718 -cleaved, PstI -partially cleaved RE plasmid 

25 DMA to generate plasmid RSD. l%)Qn-^6±pression, the 
RSD plasmid encoded a factor VIII polypeptide with a 
fusion between Arg 740 and Ser 1657. A cleavage of 
RSD polypeptide after Arg 740 generates a 2-chain 
factor VIII molecule with a mature heavy chain and a 

30 delta 8 light chain, i.e. a light chain lacking the 
first eight amino terminal amino acids. Furthermore, 
because in the primary translation product Ser is 
also at position 741, RSD may also be viewed as a 
fusion between Ser 74i and Asp 1658. A cleavage 

35 after Ser 741 may generate a 2-chain factor VIII 
molecule with a heavy chain terminating at Ser 741 
and a 69 light chain. 
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L- TRANSFECTION OF AFRICAN 

CatEEN MOMKEY KIDNEY CELLS 

We first produced African green monkey 
kidney cell line 6L by cotransfecting cell line BSC40 
5 (BSCl African green monkey kidney cells which have 
been adapted to grow at 40 ^'C), [W* Brackman and 
D. Nathan^ Proc. Natl, Acad. Sci. USA , 71, pp. 942-46 
(1974)] with pLTRtsASS and with pY3, which has a 
transcription unit for hygromycin B phosphotranferase 

10 [K. Blochlinger, and A. Diggelmann^ Mol. Cell Biol. 
4, p. 2929-31 (1984)]. Plasmid LTRtsASS contains 
a transcription unit for a temperature sensitive 
SV40 T-antigen allele. A mutant tsA58 virus is a 
temperature-sensitive mutant of SV40 which does not 

15 produce progeny at 39*C. The large T-antigen protein 
specified by the tsA58 mutant is much more labile at 
the noi5>ermissive taooperature than wild type large 
T-antigen protein [H. Tegtmeyer et al*, J. Virol 
16, pp. 168-78 (1975). The resulting cell line 6L 

20 inducibly expresses SV40 T-antigen at 33**C. 

We then transfected 6L cells with stq)er- 
coiled expression plasmids RD or RSD. The transfec- 
tion was carried out using the DEAE-dextran technique 
and chloroquine as described in Example "F". We 

25 then incubated the transfected cells at 33**C. During 
incubation, the transfected cells synthesized and 
secreted modified factor VIII :C into the culture 
fluid. The transfectants will secrete modified factor 
.VIII:C for up to 120 hours. For most assays, the 

30 cmVml ratio was approximately 5.5; that is, a con- 
fluent monolayer of 6L transfectants in a 100mm Petri 
dish (55cm^) was covered with 10ml culture fluid. 

M. FACTOR VIII ;C ACTIVITY ASSAY 

We assayed the RE (Exan^le D), RD and RSD 
35 expression constructs for factor VIII :C production 
after trans fection and incubation at 33 ^'C for three 
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days using KabiVitrum's Coatest® factor VIII assay 
kit adapted to a 96 well plate. Cells transfected 
with plasmid RE produced culture fluid having a factor 
VIII concentration whidi was 0.48% plasma level 
5 [normal plasma factor VIII concentration is approxi- 
mately 150 ng/mll. Cells transfected with plasmid 
KD produced culture fluid having a factor VIII con- 
c^tration which was 0.41% plasma level. Cells trans- 
fected with pleismid RSD produced culture fluid haying 
10 a factor VIII concentration which was 0.71% plasma 
level. 

In a similar assay, cells transfected with 
plasmids BE or RSD which had been incubated at 33 ^C 
for three days and then for an additional two days, 
15 yielded the following factor VIII concentrations in 
the cell culture fluidr 

Factor VIII rC Concentration In Culture 
Fluid As % Of Plasma Level 

3 Days , 5 Days 

20 RE Transfected Cells 0.30% . ^ 0.77% 

RSD Transfected Cells 1^50% 1-16% 

Microorganisms, recombinant DBA molecules 
and the modified factor VIII:C DNA coding sequences 
of this invention are exeniplified by a culture depo- 
25 sited in the culture collection of the American Tjfpe 
Culture Collection, in Rockville, Maryland, on 
July 22, 1986, and identified there as: 

E.coli HBlOl (RE) ' 

This culture was assigned ATCC accession number 53517. 
30 Two additional cultures were deposited in the American 
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Type Culture Collection^ in Rockville, Maryland on 
July 27, 1987, and identified there as: 

Ad^HD.2 [ E,coli HBlOl (RD)], having ATCC 
accession number 67475; and Ad*RSD.1.2 [ E>coli HBlOl 
5 (RSD)], having ATCC accession number 67476. 

While we have hereinbefore presented a 
number of embodiments of this invention, it is 
apparent that our basic construction can be altered 
to provide other embodiments which utilize the pro- 
10 cesses and compositions of this invention. Therefore, 
it will be appreciated that the scope of this inven- 
tion is to be defined by the claims appended hereto 
rather than by the specific embodiments which have 
been presented hereinbefore by way of example. 
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Vfe claim: 

1. A recombinant DNA molecule Gharacter- 
ized by a DBA sequence co(£ing on egression for a 
modified factor VIII:C-like polypeptide, said DMA 

5 sequence containing a deletion of a major part of 
the DNA sequence v^ch, codes on egression for the 
maturation polypeptide of factor VIII :C. 

2. Ihe recombinant DNA molecule according 
to claim 1, wherein the deletion is all of the DNA 

10 sequence which codes on expression for the maturation 
polypeptide of factor VIII:C. 

3. The recombinant DNA molecule according 
to claim. 1, wherein the DNA sequence coding on 
egression for the modified factor VIII:C-like poly- 

15 peptide is selected from the grot^ consisting of: 

AI6 GCC ACC AGA AGA TAG TAG GT6 GGT GCA GTG GAA GIG 
TGA TG6 GAG ZAZ AZG GAA AGI GAT GTG GGT GAG GTG GCT 
CTG GAG. GCA AGA TTT GCT GGT AGA CT6 GCA AAA TCT TTT 
CCA TTG AAC AGG TGA GTG GTG TAC AAA AAG ACT GTG TTT 

20 GTA GAA TTC AGG GAT CAC CTT TTG AAG ATG GCT AAG CCA 
AGG CCA GCC TG6 ATG GGT CTG CTA GGT CCT ACC ATG GAG 
GCT GAG GTT TAT GAT ACA GTG GTC ATT AGA CTT AAG AAC 
ATG GCT TCC CAT CCT CTC ACT CTT CAT GCT GTT GGT GTA 
TCC TAC TGG AAA GCT TCT GAG GGA GCT GAA TAT GAT GAT 

25 GAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC TTC ' 
CCT GGT GGA AGC GAT ACA TAT GTC TGG GAG GTC CTG AAA 
GAG AAT GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC 
TAG TGA TAT CTT TCT CAT GTG GAC CTG GSIA AAA GAC TTG 
AAT TGA GGC CTC ATT GGA GCC CTA CTA GTA TGT AGA GAA 

30 GGG AGT CEG GCC AAG GAA AAG ACA CAC ACC TTG CAC AAA 
TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGS 
TGG CAC TCA GAA ACA AAG^AAC TCC TTG ATG CAG GAT AGG 
GAT GCT GCA TCT GCT CGG GCC TGG CCT AAA ATG CAC ACA 
GTC AAT GCT TAT 7TA AAC AGG TCT CTG(CTA) CCA GCT CTG 

35 AIT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT GTG ATT 
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GGA AT6 GGC ACC ACT CCT GAA 
GAA GGT CAC ACA TTT CTT GTG 
TCC TTG GAA ATC TCG CCA ATA 
ACA CTC TTG ATG GAC CTT G6A 
5 CAT ATC TCT TCC CAC CAA CAT 
GTC AAA GIA GAC AGC TGT CCA 
ATG AAA AAT AAT GAA GAA GCG 
CTT ACT GAT TCT GAA ATG GAT 
GAC AAC TCT CCT TCC TTT ATC 

10 AA6 AAG CAT CCT AAA ACT TGG 
GAA GAG GAG GAC TGG GAC TAT 
CCC GAT GAC AGA AGT TAT AAA 
GGC CCT CAG CG6 ATT GGT AGG 
TTT ATG GCA TAC ACA GAT GAA 

15 GCT ATT CAG CAT GAA TCA GGA 
TAT GGG GAA GTT GGA GAC ACA 
AAT CAA GCA AGC AGA CCA TAT 
ATC ACT GAT GTC CGT CCT TTG 
AAA GGT GTA AAA CAT TTG AAG 

20 GGA GAA ATA TTC AAA TAT AAA 
GAT GGG CCA ACT AAA TCA GAT 
TAT TAC TCT ACT TTC GTT AAT 
TCA GGA CTC ATT GGC CCT CTC 
TCT CTA GAT CAA AGA GGA AAC 

25 AGG AAT GTC ATC CTG TTT TCT 
AGC TGG TAC CTC ACA GAG AAT 
AAT CCA GCT GGA GTG CAG CTT 
GCC TCC AAC ATC ATG CAC AGC 
GAT AGT TTG CAG TTG TCA GTT 

30 TAC TGG TAC ATT CTA AGC ATT 
CTT TCT GTC TTC TTC TCT GGA 
ATG GTC TAT GAA GAC ACA CTC 
GGA GAA ACT CTC TTC ATG TCG 
TGG ATT CTG GGG TGC CAC AAC 

35 GGC ATG ACC GCC TTA CTG AAG 
AAC ACT GCT GAT TAT TAC GAG 
TCA GCA TAC TTG CTG ACT AAA 
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GTG 

CAC 

TCA 

ATA 

TTC 

CTC 

AGG AAC 

CAT 

CGC 

CAG 

GCG 

ACT 

TTC 

CTT 

ACT 

GCT 

CAA 

CAG TTT 

CTA 

CTG 

TTT 

TCT 

^ At* 


ATG 

GAA 

GCT 

TAT 


GAA 

CCC 

CAA 

CTA 

CGA 

GaA 

GAC 

TAT 

GAT 

GAT 

GAT 

CTG 

CTC 

AGG 

TTT 

GAT 

GAT 

CAA 

ATT 

CGC 

TCA 

GTT 

GCC 

CTA 

CAT 

TAC 

ATT 

GCT 

GCT 

GCT 

CCC 

TTA 

GTC 

CTC 

GCC 

ACT 

CAA 

TAT 

TTG 

AAC 

AAT 

AAG 

TAC 

AAA 

AAA 

GTC 

CGA 

ACC 

TTT 

AAG 

ACT 

CGT 

GAA 

ATC 

TTG GGA 

CCT TTA 

CTT 

CTG 

TTG ATT 

ATA 

TTT 

AAG 

AAC 

ATC 

TAC 

CCT 

CAC 

GGA 

TAT 

TCA AGG 

AGA 

TTA 

CCA 

GAT 

TTT 

CCA 

ATT 

CTG 

CCA 

TGG ACA 

GTG 

ACT 

GTA 

GAA 

CCT 

CGG 

TGC 

CTG- ACC 

CGC 

ATG 

GAG AGA 

GAT 

CTA 

GCT 

CTC 

ATC 

TGC 

TAC 

AAA 

GAA 

CAG 

ATA 

ATG 

TCA 

GAC 

AAG 

GTA 

TTT 

GAT 

GAG 

AAC 

CGA 

ATA 

CAA 

CGC 

TTT 

CTC 

CCC 

GAG 

GAT 

CCA 

GAG 

TTC 

CAA 

ATC 

AAT 

GGC 

TAT 

CTT 

TTT 

TCT 

TTG 

CAT 

GAG 

CTG 

GCA 

GGA 

GCA 

CAG 

ACT 

GAC 

TTC 

TAT 

ACC 

TTC 

AAA 

CAC 

AAA 

ACC 

CTA 

TTC 

CCA 

TTC 

TCA 

ATG 

GAA 

AAC 

CCA 

GCT 

CTA 

TCA 

GAC 

TTT 

CGG 

AAC 

AGA 

CTT 

TCT 

ACT 

TGI 

GAC 

AAG 

GAC 

ACT 

TAT 

GAA 

GAT 

ATT 

AAC 

AAT 

GCC 

ATT 

GAA 

CCA 
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AGA 

AGC 

TTC 

TCC 

CAG 

GAT 


TAT 

GGT 

ACT 

CAG 

ATA 

CCA 


GAG 

AAG 

TCA 

CCA 

GAA 

AAA 


ACC 

ATT 

TTG 

TCC 

CTG 

AAC 

5 _ 

AXA 

GCA 

GCA 

ATA 

AAT 

GAG 


GAA 

GTC 

ACC 

TGG 

GCA 

AAG 


TGC 

TCT 

CAA 

AAC 

CCA 

CCA 


GftA 

ATA 

ACT 

CGT 

ACT. 

ACT 


AIT 

GAC 

TAT 

GAT 

GAT 

ACC 

10 

GAA 

GAT 

TTT 

GAC 

ATT 

TAT 


CCC 

CGC 

AGC 

TTT 

CAA 

AAG 


GCT 

GCA 

GTG 

GAG 

AGG 

CTC 


TCC 

CCA 

CAT 

GTT 

CTA 

AGA 


CTC 

CCT 

CA6 

TTC 

AAG 

AAA 

15 

GAT 

GGC 

TCC 

TTT 

ACT 

CAG 


AAT 

GAA 

CAT 

TTG 

GGUl 

CTC 


GAA 

GTT 

GAA 

GAT 

AAT 

ATC 


GCC 

TCT 

CGT 

CCC 

TAT 

TCC 


TAT 

GAG 

GAA 

GAT 

CAG 

AGG 

20 

AAC 

TTT 

GTC 

AAG 

CCT 

AAT 


AAA 

GTG 

CAA 

CAT 

CAT 

ATG 


GAC 

TGC 

AAA 

GCC 

TGG 

GCT 


GAA 

AAA 

GAT 

GTG 

CAC 

TCA 


GTC 

TGC 

CAC 

ACT 

AAC 

ACA 

25 

CAA 

GTG 

ACA 

GTA 

CAG 

GAA 


ATC 

TTT 

GAT 

GAG 

ACC 

AAA 


ATG 

GAA 

AGA 

AAC 

TGC 

AGG 


GAA 

GAT 

CCC 

ACT 

TTT 

AAA 


ATC 

AAT 

GGC 

TAG 

ATA 

ATG 

30 

ATG 

GCT 

CAG 

GAT 

CAA 

AGG 


ATG 

GGC 

AGC 

AAT 

GAA 

AAC 


GGA 

CAT 

GTG 

TTC 

ACT 

CTA 


ATG 

GCA 

CTG 

TAC 

AAT 

CTC 


GTG 

GAA 

ATG 

TTA 

CCA 

TCC 

35 

GAA 

TGC 

CTT 

ATT 

GGC 

GAG 


ACA 

CTT 

TTT 

CTG 

GTG 

TAC 


CTG 

GGA 

ATG 

GCT 

TCT 

GGA 


■40 


CCT 

CTT 

GCT 

TGG 

GAT 

AAC 

CAC 

AAA 

GAA 

GAG 

TGG 

AAA 

TCC 

CAA 

ACA 

GCT 

TTT 

AAG 

AAA 

AAG G|AT 

GCT 

TCT 

GAA 

AGC 

AAT 

CAT 

GCA 

GGA 

CAA 

AAT 

AAG 

CCC 

GAA ATA 

CAA 

GCT 

AGG 

ACT 

GAA 

AGG 

CTG 

GTC 

TTG 

AAA 

CGC 

CAT 

CAA 

CGG 

CTT 

CAG 

TCA 

GAT 

CAA 

GAG 

GAA 

ATA 

TCA 

GTT 

GAA 

ATG 

AAG 

AAG 

GAT 

GAG 

GAT 

GAA 

AAT 

CAG 

AGC 

AAA 

ACA 

CGA 

CAC 

TAT 

TTT 

ATT 

TGG 

GAT 

TAT 

GGG 

ATG 

ACT 

AGC 

AAC 

AGG 

GCT 

CAG 

ACT 

GGC 

ACT 

GTT 

CTT 

TTC 

CAG 

GAA 

TTT 

^CT 

CCC 

TTA 

TAC 

CCT 

GGA 

GAA 

CTA 

CTG 

GGG 

CCA 

TAT 

ATA 

AGA 

GCA 

ATG 

GTA 

ACT 

TTC 

AGA 

AAT 

CAG 

TTC 

TAT 

TCT 

AGC 

CTT 

AIT 

TCT 

CAA 

GGA 

GCA 

GAA 

CCT 

AGA 

AAA 

GAA 

ACC 

AAA 

ACT 

TAC 

TTT 

TGG 

GCA 

CCC 

ACT 

AAA 

GAT 

GAG 

TTT 

TAT 

TTC 

TCT 

GAT 

CTC 

GAC 

CTG 

GGC 

CTG 

ATT 

GGA 

CCC 

CTT 

CTG 

CTG 

AAC 

CCT 

GCT 

CAT 

GGG 

AGA 

TTT 

GCT 

CTG 

TTT 

TTC (CTC) ACC 

AGC 

TGG 

TAC 

TTC 

ACT GAA AAT 

GCT 

CCC 

TGC 

AAT 

ATC 

CAG ATG 

GAG 

AAT 

TAT 

CGC 

TTC. 

CAT GCA 

GAT 

ACA 

CTA 

CCT 

GGC 

TTA GTA 

ATT 

CGA 

TGG 

TAT 

CTG CTC AGC 

ATC 

CAT 

TCT 

ATT 

CAT 

TTC 

ACT 

CGA 

AAA 

AAA 

GAG 

GAG 

TAT AAA 

TAT 

CCA 

GCT 

CTT 

TTT 

GAG ACA 

AAA 

GCT 

GGA 

AIT 

TGG CGG 

GIG 

CAT 

CTA 

CAT 

GCT 

GGG ATG AGC 

AGC 

AAT 

AAG 

TCT 

CAG 

ACT CCC 

CAC 

ATT 

AGA 

GAT 

TTT 

CAG ATT 
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ACA GCT TCA GGA CAA TAT GGA CAG TGG GCC CCA AAG CTG 
GCC AGA CTT CAT TAT TCC GGA TCA AIC AAT GCC TGG AGC 
ACC AAG GAG CCC TTT TCT TGG ATC AAG GTG GAT CTG TTG 
GCA CCA ATG AIT AIT CAC GGC ATC AAG ACC CAG GGT GCC 
5 CGT CAG AAG TTC TCC AGC CTC TAC ATC TCT CAG TTT ATC 
ATC ATG TAT ACT CTT GAT GGG AAG AAG TGG CAG ACT TAT 
CGA GGA AAT TCC ACT GGA ACC TTA ATG CTC TTC TTT GGC 
AAT CTG GAT TCA TCT GGG ATA AAA CAC AAT AIT TTT AAC 
CCT CCA AIT ATT GCT CGA TAC ATC CCT TTG CAC CCA ACT 

10 CAT TAT AGC AIT CGC AGC ACT CTT CGC ATG GAG TTG ATG 
GGC TCT GAT TTA AAT ACT TGC AGC ATG CCA TTG GGA ATG 
GAG ACT AAA GCA AIA TCA GAT GCA CAG ATT ACT GCT TCA 
TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA 
AAA GCT CGA CTT CAC CTC CAA GGG AGG ACT AAT GCC TGG 

15 AGA CCT CAG GTG AAT AAT CCA AAA GAG TGG CTG CAA GTG 
GAG TTC CAG AAG ACA ATG AAA CTC ACA GGA STA ACT ACT 
CAG GGA GTA AAA TCT CTG CTT ACC AGC ATG TAT CTG AAG 
GAG TTC CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG TGG. 
ACT CTC TTT TTT CAG AAT GGC AAA CTA AAG CTT TTT CAG 

20 GGA AAT CAA GAC TCC TTC ACA CCT CTG CTG AAC TCT CTA 
GAC CCA CCG TTA CTG ACT CGC TAC CTT CGA ATT CAC CCC 
CAG ACT TGG CTG CAC CAG ATT GCC CTG AGG ATG GAG GTT 
CTG GGC TGC GAG GCA CAG GAC CTC TAC; 
GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA 

25 TGG GAC TAT ATG CAA ACT GAT CTC GCT GAG CTG CCT GTG 
GAC GCA AGA TTT CCT CCT AGA CTG CCA AAA TCT TTT CCA 
TTC AAC ACC TCA CTC CTG TAC AAA AAG ACT CTG TTT GTA 
GAA TTC ACG GAT CAC CTT TTC AAC ATC GCT AAG CCA AGG 
CCA CCC TGG ATG GCT CTG CTA GCT CCT ACC ATC CAG GCT 

30 GAG GTT TAT GAT ACA CTG CTC ATT ACA CTT AAG AAC ATG 
GCT TCC CAT CCT CTC ACT CTT CAT GCT GTT GCT CTA TCC 
TAC TGG AAA GCT TCT GAG GGA GCT GAA TAT GAT GAT CAG 
ACC ACT CAA AGG GAG AAA GAA GAT GAT AAA CTC TTC CCT 
GCT GGA AGC CAT ACA TAT CTC TGG CAG CTC CTG AAA GAG 

35 AAT GCT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC 
TCA TAT CTT TCT CAT CTG GAC CTG GTA AAA GAC TTG AAT 
TCA GGC CTC ATT GGA GCC CTA CTA GTA TCT AGA GAA GGG 


wo 88/00831 


PCr/US87/»l8l4 


-42- 

ACT CTG GCC AAG GAA AAG ACA CAC ACC TTG CAC AAA TIT 
ATA CIA CTT TIT GCT GTA TIT GAT GAA GGG AAA ACT TGG 
CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT 
GCT GCA TCT GCT CGG GCC TGG CCT AAA ATG CAC ACA CTC 
5 AAT GCT TAT CTA AAC AGG TCT CTG(CTA) CCA GCT CTG ATT 
GGA TGC CAC AGG AAA TCA CTC TAT TGG CAT CTG ATT GGA 
ATG GGC ACC ACT CCT GAA CTG CAC TCA ATA TTC CTC GAA 
GCT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG TCC 
TTG GAA ATC TCG CCA" ATA ACT TTC CTT ACT GCT CAA ACA 

10 CTC TTG ATG GAC CTT GGA CAG TTT CTA CTG TTT TCT CAT 
ATC TCT TCC CAC CAA CAT GAT GGC ATG GAA GCT TAT CTC 
AAA GTA GAC AGC TCT GCA GAG GAA CCC CAA CTA CGA ATG 
AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT CTT 
ACT GAT TCT GAA ATG GAT CTG CTC AGG TTT GAT GAT GAC 

15 AAC TCT CCT TCC TTT ATC CAA ATT CGC TCA CTT GCC AAG 
AAG CAT CCT AAA ACT TGG CTA CAT TAC ATT GCT GCT GAA 
GAG GAG GAC TGG GAC TAT GCT CCC TTA CTC CTC GCC CCC 
GAT GAC AGA ACT TAT AAA ACT CAA TAT TTG AAC AAT 'GC 
CCT CAG CGG AIT GCT AGG AAG TAC AAA AAA CTC CGA TTT 

20 ATG GCA TAC ACA GAT GAA ACC TTT AAG ACT CCT GAA GCT 
ATT CAG CAT GAA TCA GGA ATC TTG GGA CCT TTA CTT TAT 
GGG GAA CTT GGA GAC ACA CTG TTG ATT ATA TIT AAG AAT 
CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA ATC 
ACT GM CTC CCT CCT TTG TAT TCA AGG AGA TTA CCA AAA 

25 GCT GTA AAA CAT TTG AAG GAT TTT CCA AIT CTG CCA GGA 
GAA ATA TTC AAA TAT AAA TGG ACA GTG ACT CTA GAA G^ 
GGG CCA ACT AAA TCA GAT CCT CGG TGC CTG ACC CGC TAT 
TAC TCT ACT TTC CTT AAT ATG GAG AGA GAT CTA GCT TCA 
GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT 

30 GTA GAT CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG 
AAT CTC ATC CTG TTT TCT CTA TTT GAT GAG AAC CGA AGC 
TGG TAC CTC ACA GAG AAT ATA CAA CGC TTT CTC CCC AAT 
CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC CAA GCC 
TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GIT TIT GAT 

35 ACT TTG CAG TTG TCA CTT TCT TIG CAT GAG CTG GCA TAC 
TGG TAC ATT CTA AGC AIT (3GA GCA CAG ACT GAC TTC CTT 
TCT CTC TTC TTC TCT GGA TAT ACC TTC AAA ZAC AAA ATG 
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CAT GIG TTC ACT CTA. AAA . AAA GAG GAG XAT AAA ATG 

GCA CTG TAC AAT CTC TAT CCA GGT CTT TIT GAG ACA CTG 
GAA ATG TTA CCA TCC AAA GCT GGA AIT TG6 CGG (S£G GAA 
TGC CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG AGC ACA 
5 CTT TTT CTG GTG TAC AGC AM AAG TGI CAG ACT CCC CTG 
GGA ATG GCT TCT GGA CAC ATT AGA GAT TTT CAG ATT ACA 
GCT TCA GGA CAA TAT GGA CAG TGG GCC CCA AAG CTG GCC 
AGA CTT CAT TAT TCC GGA TCA ATC AAT GCC TGG AGC ACC 
AAG GAG CCC TTT TCT TGG ATC AAG CTG GAT CTG TTG GCA 

10 CCA ATG ATT ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT 
CAG AAG TTC TCC AGC CTC TAC ATC TCT CAG TTT ATC ATC 
ATG TAT AGT CTT GAT GGG AAG AAG TGG CAG ACT TAT CGA 
GGA AAT TCC ACT GGA ACC TTA ATG GTC TTC TTT GGC AAX 
GIG GAT TCA TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT 

15 CCA ATT ATT GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT 
TI)!r AGC ATT CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC 
TGT GAT TTA AAT AGT TGC AGC ATG CCA TTG GGA ATG GAG 
AGT AAA GCA ATA TCA GAT GCA CAG ATT ACT GCT TCA TCC 
TAC TTT ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA AAA 

20 GCT CGA CTT CAC CTC CAA GGG AGG AGT AAT GCC TGG AGA 
CCT CAG GTG AAT . AAT CCA AAA GAG TGG CTG CAA GTG GAC 
TTC CAG AAG ACA ATG AAA GTC ACA GGA GTA ACT ACT CAG 
GGA GTA AAA TCT CTG CTT ACC AGC ATG TAT GTG AAG GAG 
TTC CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG TGG ACT 

25 CTC TTT TTT CAG AAT GGC AAA GTA AAG GKET TTT CAG GGA 
AAT CAA GAC TCC TTC ACA CCT GTG GTG AAC TCT CTA GAC 
CCA CCG TTA CTG ACT CGC TAC CTT CGA ATT CAC CCC CAG 
AGT TGG GTG CAC CAG ATT GCC CTG AGG ATG GAG GEET CTG 
GGC TGC GAG GCA CAG GAC CTC TAC; 

30 ATG GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG 
TCA TGG GAC TAT ATG CAA AGT GAT CTC GCT GAG CTG CCT 
GTG GAC GCA AGA TTT CCT CCT AGA GTG CCA AAA TCT TTT 
CCA TTC AAC ACC TCA GTC CTG TAC AAA AAG ACT CTG TTT 
GTA GAA TTC ACG GAT CAC CTT TTC AAC ATC GCT AAG CCA 

35 AGG CCA CCC TGG ATG GCT CTG CTA GCT CCT ACC ATC CAG 
GCT GAG GTT TAT GAT ACA GTG CTC ATT ACA CTT AAG AAC 
ATG GCT TCC CAT CCT GTC ACT CTT CAT GCT GTT GCT CTA 
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AGC TGG TAG CTC ACA GAG AAT ATA CAA -CGC TIT CTC CCC 
AAT CCA GCT GGA GTG CAG CET GAG GAT CCA GAG TTC CAA 
GCC TCC AAC AIC AIG CAC AGC ATC AAT GGC TAT CTT TTT 
GAT ACT TTG CAG TTG TCA GKET TCT TTG CAT GAG GTG GCA 
5 TAG TGG TAG ATT CTA AGC ATT GGA GCA CAG ACT GAC TTC 
CTT TCT GIG TTG TTC TCT GGA TAT ACC TTC AAA CAC AAA 
ATG' GTG TAT GAA GAC ACA CTC ACC CTA TTC CCA TTG TCA 
GGA GAA ACT GTG TTC ATG TGG ATG GAA AAC GCA GGT CTA 
TGG ATT GTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA 

10 GGC ATG AGC GCC TTA GTG AAG GET TCT ACST TGT GAC AAG 
AAC ACT GGT GAT TAT TAG GAG GAC ACT TAT GAA GAT AIT 
TCA GGA TAG TTG GTG ACT AAA AAG AAT GGG AIT GAA CCA 
AGA GAA ATA ACT GCT ACT ACT CUT CAG TCA GAT GAA GAG 
GAA AIT GAC TAT GAT GAT ACC ATA TCA CTT GAA ATG AAG 

15 AAG GAA GAT TTT GAC AIT TAT GAT GAG GAT GAA AAT CAG 
AGC CCC CGC AGC TTT CAA AAG AAA ACA GGA CAC TAT TTT 
ATT GCT GGA CTG GAG AGG CTC TGG GAT TAT GGG ATG ACT 
AGC TCC CCA CAT CTT CTA AGA AAC AGG GGT GAG ACT GGC 
ACT CTC GCT CAG TTC AAG AAA CTT CTT TTC CAG GAA TTT 

20 ACT GAT GGC TCC TIT ACT CAG CCC TTA TAG GCT GGA GAA 
CTA AAT GAA CAT TTG GGA CTC CTG GGG CCA TAT ATA AGA 
GGA GAA CTT GAA GAT AAT ATC ATC GTA ACT TTC AGA AAT 
CAG GCC TCT GCT CGC TAT TGC TTC TAT TCT AGC CTT AIT 
TCT TAT GAG GAA GAT CAG AGG CAA GGA GCA GAA GCT AGA 

25 AAA AAC TTT CTG AAG CCT AAT GAA ACC AAA ACT TAG TTT 
TGG AAA GTG CAA CAT CAT ATG GCA CGC ACT AAA GAT GAG 
TTT GAC TGC AAA GGG TGG GCT TAT TTC TCT GAT CTT GAG 
CTG GAA AAA GAT CTG CAC TCA GGC CTG AOS GGA CCC CTT 
CTG CTC TGG CAC ACT AAG ACA CTG AAC CCT GCT CAT GGG 

30 AGA CAA GTG ACA CTA CAG GAA TTT GCT CTG TTT TTG(CTC) 
ACC 2MC TTT GAT GAG ACC AAA AGG TGG TAG TTG ACT GAA 
AAT ATG GAA AGA AAC TGC AGG GCT GCC TGG AAT ATC CAG 
ATG GAA GAT GCC ACT TTT AAA GAG AAT TAT CGC TTG GAT 
GCA ATC AAT GGC TAG MA ATG GAT ACA GTA CCT GGC TTA 

35 GTA ATG GCT GAG GAT CAA AGG AIT GGA TGG TAT CTG CTC 
AGC ATG GGC AGC AAT GAA AAC AIC CAT TCT ATT CAT TTC 
ACT GGA CAT CTG TTC ACT CTA GGA AAA AAA GAG GAG TAT 
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GCT 

<»:a 

GTG GAA 

CTG 

TCA 

30 

TGG 

GAC 

TAT 

ATG 

Mik Ik 

CAA 

ACtX 

GAX 

CTC 

GGT 

GAG CTG 

CCT 

GTG 


GAC 

GCA 

AGA 

TTT 

CCT 

CCT 

AGA 

GTG 

CCA 

AAA TCT 

TTT 

CCA 


TTC 

AAC 

ACC 

TCA 

CTC 

GTG 

TAC 

AAA 

AAG 

ACT CTG 

TTT 

GTA 


GAA 

TTC 

ACG 

GAT 

CAC 

CTT 

TTC 

AAC 

ATC 

GCT AAG CCA 

AGG 


CCA 

CCC 

TGG 

ATG 

GCT 

CTG 

CTA 

GCT 

CCT 

ACC ATC 

CAG 

GCT 

35 

GAG 

GTT 

TAT 

GAT 

ACA 

CTG 

CTC 

ATT 

ACA 

CTT AAG AAC 

ATG 


GCT 

TCC 

CAT 

CCT 

CTC 

ACT 

CTT 

CAT 

GCT 

GTT GST 

GTA 

TCC 
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T&C 

TGG 

AAA 

CCT 

TCT 

GAG GGA 

GCT 

GAA 

TAT 

GAT 

GAT 

CAG 


ACC 

AST 

CAA 

AGG 

GAG 

AAA GAA 

GAT 

GAT 

AAA 

CTC 

TTC 

CCT 


GGT 

GGA 

AGC 

CAT 

ACA 

TAT GTC 

TGG 

CAG 

GTC 

CTG AAA 

GAG 


AAT 

GGT 

CCA 

ATG 

GCC 

TCT GAC 

CCA 

CTG 

TGC 

CTT 

ACC 

TAC 

5 

TCA 

TAT 

CTT 

TCT 

CAT 

SEG GAC 

CTG GTA AAA GAC 

TTG 

AAT 


TCA 

GGC 

CTC 

ATT 

GGA 

GCC CTA 

CTA GTA 

TCT 

AGA 

GAA 

GGG 


AG! 

CTG 

GGC 

AAG 

GAA 

AAG ACA 

CAC ACC 

TTG 

CAC 

AAA 

TTT 


AXA 

CTA 

CTT 

TTT 

GCT 

CTA "TTT 

GAT 

GAA 

GGG 

AAA ACT 

TGG 


CAC 

TCA 

GAA 

ACA 

AAG 

AAC TCC 

TTG 

ATG 

CAG 

GAT 

AGG 

GAT 

la 

6CT 

GCA 

TCi 

GCT 

CGG 

GCC TGG 

CCT 

AAA ATG CAC 

ACA 

CTC 


AAT 

GGT 

TAT 

GTA 

AAC 

AGG TCT 

CTG(CTA) CCA GCT CTG ATT 


GGA 

TGC 

CAC 

AGG 

AAA 

TCA GTC 

TAT 

TGG 

CAT 

GTG 

ATT 

GGA 


AIG 

GGC 

ACC 

ACT 

CCT 

GAA GTG 

CAC 

TCA 

ATA 

TTC 

CTC 

GAA 


GGT 

CAC 

ACA 

TTT 

CTT 

GTG AGG 

AAC 

CAT 

CGC 

CAG 

GCG 

TCC 

15 

TIG 

GAA 

ATC 

TCG 

CCA 

ATA ACT 

TTC 

CTT 

ACT 

GCT 

CAA 

ACA 


CTC 

TTG 

ATG 

GAC 

CTT 

GGA CAG 

TTT 

CTA 

CTG 

TTT 

TCT 

CAT 


ATC 

TCT 

TCC 

CAC 

CAA 

CAT GAT 

GGC 

ATG 

GAA 

GCT 

TAT 

GTC 


AAA 

GTA 

GAC 

AGC 

TGT 

CCA GAG 

GAA 

CCC 

CAA 

CTA 

CCA 

ATG 


AAA 

AAT 

AAT 

GAA 

GAA 

GCG GAA 

GAC 

TAT 

GAT 

GAT 

GAT 

CTT 

20 

ACT 

GAT 

TCT 

GAA 

ATG 

GAT CTG 

CTC 

AGG 

TTT 

GAT 

GAT 

GAC 


AAC 

TCT 

CCT 

TCC 

TTT 

ATC. CAA 

ATT 

CGC 

TCA 

CTT 

GCC 

AAG 


AA6 

CAT 

CCT 

AAA 

act' TGG GTA 

CAT 

TAC 

ATT 

GCT 

GCT 

GAA 


GAG 

GAG 

GAC 

TGG 

GAC 

TAT GCT 

dec 

TTA 

CTC 

CTC 

GCC 

CCC 


GAT 

GAC 

AGA 

AGT 

TAT AAA AGT 

CAA 

TAT 

TTG 

AAC 

AAT 

GGC 

25 

OCT 

CAG 

CGG 

ATT 

GGT AGG AAG 

TAC 

AAA 

AAA 

GTC 

CGA 

TTT 


ATG 

GCA 

TAC 

ACA 

GAT 

GAA ACC 

TTT 

AAG 

ACT 

CCT 

GAA 

GCT 


ATT 

CAG 

CAT 

GAA 

TCA 

GGA ATC 

TTG 

GGA 

CCT 

TTA 

CTT 

TAT 


G6G 

GAA 

GTT 

GGA 

GAC 

ACA CTG 

TTG 

AIT 

ATA 

TTT 

AAG 

AAT 


CAA 

GCA 

AGC 

AGA 

CCA TAT AAC 

ATC 

TAC 

CCT 

CAC 

GGA 

ATC 

30 

ACT 

GAT 

GTC 

CGT 

CCT 

TTG TAT 

TCA 

AGG 

AGA 

TTA 

CCA 

AAA 


GGT 

GTA 

AAA 

CAT 

TTG AAG GAT 

TTT 

CCA 

ATT 

CTG 

CCA 

GGA 


GAA 

ATA 

TTC 

AAA 

TAT 

AAA TGG 

ACA 

CTG 

ACT 

GTA 

GAA 

GAT 


GGG 

CCA 

ACT 

AAA 

TCA 

GAT CCT 

CGG 

TGC 

CTG 

ACC 

CGC 

TAT 


TAC 

TCT 

AST 

TTC 

GTT 

AAT ATG 

GAG 

AGA 

GAT 

CTA 

GCT 

TCA 

35 

GGA 

CTC 

AIT 

GGC 

CCT 

CTC CTC 

ATC 

TGC 

TAC 

AAA 

GAA 

TCT 


GTA 

GAT 

CAA 

AGA 

GGA 

AAC CAG 

ATA 

ATG 

TCA 

GAC 

AAG 

AGG 


AAT 

GTC 

ATC 

CTG 

TTT 

TCT CTA 

TTT 

GAT 

GAG 

AAC 

CGA 

AGC 
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. TGG 

TAC 

CTC 

ACA 


CCA 

GCT 

GGA 

GIG 


TCC 

AAC 

ATC 

ATG 


AGT 

TTG 

CAG 

TTG 

5 

TGG 

TAC 

ATT 

CCA 


TCT 

GTC 

TTC 

TTC 


CTC 

TAT 

GAA 

GAC 


GAA 

ACT 

GTC 

TTC 


ATT 

CTG 

GGG 

TGC 

10 

ATG 

ACC 

GCC 

TTA 


ACT 

GGT 

GAT 

TAT 


GCA 

TAC 

TTG 

CTG 


GAA 

ATA 

ACT 

CGT 


ATT 

GAC 

TAT 

GAT 

15 

GAA 

GAT 

TTT 

GAC 


CCC 

CGC 

AGC 

TTT 


GCT 

GCA 

GTG 

GAG 


TCC 

CCA 

CAT 

GTT 


GTC 

CCT 

CAG 

TTC 

20 

GAT 

GGC 

TCC 

TTT 


AAT 

GAA 

CAT 

TTG 


GAA 

GTT 

GAA 

GAT 


GCC 

TCT 

CGT 

CCC 


TAT 

GAG 

GAA 

GAT 

25 

AAC 

TTT 

GTC 

AAG 


AAA 

GTG 

CAA 

CAT 


GAC 

TGC 

AAA 

GCC 


GAA 

AAA 

GAT 

GTG 


GTC 

TGC 

CAC 

ACT 

30 

CAA 

GTG 

ACA 

GTA 


ATC 

TTT 

GAT 

GAG 


ATG 

GAA 

AGA 

AAC 


GAA 

GAT 

CCC 

ACT 


ATC 

AAT 

GGC 

TAC 

35 

ATG 

GCT 

CAG 

GAT 


ATG 

GGC 

AGC 

AAT 


GGA 

CAT 

GTG 

TTC 
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GAG 

AAT 

ATA 

CAA 

CGC 

TTT 

CAG 

CTT 

GAG 

GAT 

CCA 

GAG 

CAC 

AGC 

ATC 

AAT 

GGC 

TAT 

TCA 

GTT 

TCT 

TTG 

CAT 

GAG 

AGC 

AIT 

GGA 

GCA 

CAG 

ACT 

TCT 

GGA 

TAT 

ACC 

TTC 

AAA 

ACA 

CTC 

ACC 

CTA 

TTC 

CCA 

ATG 

ICG 

ATG 

GAA 

AAC 

CCA 

CAC 

AAC 

TCA 

GAC 

TTT 

CGG 

CTG 

AAG 

GIT 

TCT 

AGT 

TCT 

TAC 

GAG 

GAC 

AGT 

TAT 

GAA 

AGT 

AAA 

AAC 

AAT 

GCC 

ATT 

ACT 

ACT 

CTT 

CAG 

TCA 

GAT 

GAT 

ACC 

ATA 

TCA 

GTT 

GAA 

ATT 

TAT 

GAT 

GAG 

GAT 

GAA 

CAA 

AAG 

AAA 

ACA 

CGA 

CAC 

AGG 

CTC 

TGG 

GAT 

TAT 

GGG 

CTA 

AGA 

AAC 

AGG 

GCT 

CAG 

AAG 

AAA 

GTT 

GTT 

TTC 

CAG 

ACT 

CAG 

CCC 

TTA 

TAC 

CCT 

GGA 

CTC 

CTG 

GGG 

CCA 

TAT 

AAT 

ATC 

ATG 

GTA 

ACT 

TTC 

TAT 

TCC 

TTC 

TAT 

TCT 

AGC 

CAG 

AGG 

CAA 

GGA 

GCA 

GAA 

CCT 

AAT 

GAA 

ACC 

AAA 

ACT 

CAT 

ATG 

GCA 

CCC 

ACT 

AAA 

TGG 

GCT 

TAT 

TTC 

TCT 

GAT 

CAC 

TCA 

GGC 

CTG 

AIT 

GGA 

AAC 

ACA 

CTG 

AAC 

CCT 

GCT 

CAG 

GAA 

TTT 

GCT 

CTG 

TTT 

ACC 

AAA 

AGC 

TGG 

TAC 

TTC 

TGC 

AGG 

GCT 

CCC 

TGC 

AAT 

TTT 

AAA 

GAG 

AAT 

TAT 

CGC 

ATA 

ATG 

GAT 

ACA 

CTA 

CCT 

CAA 

AGG 

ATT 

CGA 

TGG 

TAT 

GAA 

AAC 

ATC 

CAT 

TCT 

AIT 

ACT 

GTA 

CGA 

AAA 

AAA 

GAG 
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CTC 

CCC 

AAT 

TTC 

CAA 

GCC 

GTT 

TTT 

GAT 

CTG 

GCA 

TAC 

GAC 

TTC 

CTT 

CAC 

AAA 

ATG 

TTC 

TCA 

GGA 

GCT 

CTA 

TGG 

AAC 

AGA 

GGC 

GAC 

AAG 

AAC 

GAT 

ATT 

TCA 

GAA 

CCA 

AGA 

CAA 

GAG 

GAA 

ATG 

AAG 

AAG 

AAT 

CAG 

AGC 

TAT 

TTT 

ATT 

ATG 

ACT 

AGC 

ACT 

GGC 

ACT 

GAA 

TTT 

ACT 

GGA 

GAA 

CTA 

ATA 

AGA 

GCA 

AGA 

AAT 

CAG 

CTT 

ATT 

TCT 

CCT 

AGA 

AAA 

TAC 

TTT 

TGG 

GAT 

GAG 

TTT 

GTT 

GAC 

CTG 

CCC 

CTT 

CTG 

CAT 

GGG 

AGA 

TTC (CTC) ACC 

ACT 

GAA 

AAT 

ATC 

CAG 

ATG 

TTC 

CAT 

GCA 

GGC 

TTA 

GTA 

CTG 

CTC 

AGC 

CAT 

TTC 

ACT 

GAG 

TAT 

AAA 
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AI6 

GCA 

CTG 

TAC 

AAT 

CTC 

TAT 

CCA 

GSC CTT 

TTT 

GAG 

ACA 

GTG 

GAA 

ATG 

TTA 

CCA 

TCC 

AAA 

GCT 

GGA ATT 

TGG 

asG 

GTG 

GAA 

TGC 

CTT 

ATT 

GGC 

GAG 

CAT 

CTA 

CAT GCT GGG ATG 

AGC 

ACA 

CTT 

TTT 

CTG 

GIG 

TAC 

AGC 

AAT 

AAG TGT 

CAG ACT 

CCC 

CIG 

GGA 

ATG 

GCT 

TCI 

GGA 

CAC 

ATT 

AGA GAT TTT 

CAG 

ATT 

ACA 

GCT 

TCA 

GGA 

CAA 

TAT 

GGA 

CAG 

TGG GCC CCA AAG 

CTG 

GCC 

AGA 

CTT 

CAT 

TAT 

TCC 

GGA 

TCA 

ATC AAT 

GCC 

TGG 

AGC 

ACC 

AAG 

GAG 

CCC 

TTT 

TCT 

TGG 

ATC 

AAG GTG 

GAT 

CTG 

TTG 

GCA 

CCA 

ATG 

ATT 

AIT 

CAC 

GGC 

ATC 

AAG. ACC. CBk got 

GCC '-• • 

CGI 

CAG 

AAG 

TTC 

TCC 

AGC 

CTC 

TAC 

ATC TCT. tZSSTXI 

ATC .: 

ATC 

ATG 

TAT 

AGT 

CTT 

GAT 

GGG 

AAG 

AASi.TGSrjQK^'flGS&TSTl. ~- 

CGA 

GGA 

AAT 

TCC 

ACT 

GGA 

ACC 

TTA 

JHJX-GICr.-LTLXir.^TTT 

GQC:-.. 

AAT 

GTG 

GAT 

TCA 

TCT 

GGG 

ATA 

AAA 

CAC AAT" iCTT?. ITT 

AAG 

CCT 

CCA 

ATT 

ATT 

GCT 

CGA 

TAC 

ATC 

CGE TIG 

CAC 

CCA 

ACT 

CAT 

TAT 

AGC 

ATT 

CGC 

AGC 

ACT 

CTT 

CGC ATG 

GAG 

TIG 

ATG 

6GC 

TGT 

GAT 

TTA 

AAT 

AGT 

TGC 

AGC 

ATG CCA 

TIG 

GGA 

ATG 

GAG 

AGT 

AAA 

GCA 

ATA 

TCA 

GAT 

GCA 

CAG ATT 

ACT 

GCT 

TCA 

TCC 

TAG 

TTT 

ACC 

AAT 

ATG 

TTT 

GCC 

ACC TGG 

TCT 

CCT 

TCA 

AAA 

GCT 

CGA 

CTT 

CAC 

CTC 

CAA 

GGG 

AGG AGT 

AAT 

GCC 

TGG 

AGA 

CCT 

CAG 

GTG 

AAT 

AAT 

CCA 

AAA 

GAG TGG 

CTG 

CAA 

GTG 

GAG 

TTC 

CAG 

AAG 

ACA 

ATG 

AAA 

GTC 

ACA GGA 

GTA 

ACT 

ACT 

GAG 

GGA 

GTA 

AAA 

TCT 

CTG 

CTT 

ACC 

AGC ATG 

TAT 

CTG 

AAG 

GAG 

TTC 

CTC 

ATC 

TCC 

AGC 

AGT 

CAA 

GAT GGC 

CAT 

CAG 

TGG 

ACT 

CTC 

TTT 

TTT 

CAG 

AAT 

GGC 

AAA 

GTA AAG 

GIT 

TTT 

CAG 

GGA 

AAT 

CAA 

GAC 

TCC 

TTC 

ACA 

CCT 

GTG GTG 

AAC 

TCT 

CIA 

GAC 

CCA 

CCG 

TTA 

CTG 

ACT 

CGC 

TAC 

CTT CGA 

ATT 

CAC 

CCC 

CAG 

AGT 

TG6 

GIG 

CAC 

CAG 

ATT 

GCC 

CTG AGG 

ATG 

GAG 


CTG 

GGC 

TGC 

GAG 

GCA 

CAG 

GAC 

CTC 

TAC. 





4» The recombinant DNA molecule according 
30 to any one of claims 1-^3, wherein the DNA sequence 
coding on expression for the modified factor VIITrC- 
like polypeptide is operatively linked to an expres- 
sion control sequence. 
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- 5. The recoinbinant DNA molecule according 
to claim 4, wherein the expression control sequence 
is selected from the group consisting of the lac 
system, the trp system, the tac system, the trc 
5 system, major operator and promoter regions of 

phage k, the control region of fd coat protein, the 
early and late promoters of SV40, promoters derived 
from polyoma, adenovirus and simian virus, the 
promoter for 3-phosphoglycerate Icinase or other 
10 glycolytic enzymes, the promoters of yeast acid phos- 
phatase, the promoters of the yeast a -mating factors, 
and other sequences known to control the expression 
of genes of prokaryotic or eukaryotic cells and their 
viruses, or combinations thereof. 

15 6, A modified factor VIII:C-like polypep- 

tide having a formula selected from the group consist- 


ing of: 



met 

ala 

thr arg 

arg 

tyr 

tyr 

leu 

gly 

ala 

val 

glu 

leu 


ser 

trp 

asp tyr 

met 

gin 

ser 

asp 

leu 

gly 

glu 

leu 

pro 

20 

val 

asp 

ala arg 

phe 

pro 

pro 

arg 

val 

pro 

lys 

ser 

phe 


pro 

phe 

asn thr 

ser 

val 

val 

tyr 

lys 

lys 

thr 

leu 

phe 


val 

glu 

phe thr 

asp 

his 

leu 

phe 

asn 

ile 

ala 

lys 

pro 


arg 

pro 

pro trp 

met 

gly 

leu 

leu 

gly 

pro 

thr 

ile 

gin 


ala 

glu 

val tyr 

asp 

thr 

val 

val 

ile 

thr 

leu 

lys 

asn 

25 

met 

ala 

ser his 

pro 

val 

ser 

leu 

his 

ala 

val 

gly 

val 


ser 

tyr 

trp lys 

ala 

ser 

glu 

gly 

ala 

glu 

tyr 

asp 

asp 


gin 

thr 

ser gin 

arg 

glu 

lys 

glu 

asp 

asp 

lys 

val 

phe 


pro 

gly 

gly ser 

his 

thr 

tyr 

val 

trp 

gin 

val 

leu 

lys 


glu 

asn 

gly pro 

met 

ala 

ser 

asp 

pro 

leu 

cys 

leu 

thr 

30 


ser 

■^r leu 

ser 

his 

val 

asp 

leu 

val 

lys 

asp 

leu 


asn 

ser 

gly leu 

ile 

gly 

ala 

leu 

leu 

val 

cys 

arg 

glu 


gly 

ser 

leu ala 

lys 

glu 

lys 

thr 

gin 

thr 

leu 

his 

lys 


phe 

ile 

leu leu 

phe 

ala 

val 

phe 

asp 

glu 

gly 

lys 

ser 


trp 

his 

ser glu 

thr 

lys 

asn 

ser 

leu 

met 

gin 

asp 

arg 

35 

asp 

ala 

ala ser 

ala 

arg 

ala 

trp 

pro 

lys 

met 

his 

thr 


val 

asn 

gly try 

val 

asn 

arg 

ser 

leu 

pro 

gly 

leu 

ile 
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gly 

cys 

his arg 

lys 

ser 

val tyr 

trp 

his 

val 

ile 

gly 


met 

gly 

thr thr 

pro 

glu 

val his 

ser 

ile 

phe 

leu 

glu 


gly 

his 

thr phe 

leu 

val 

arg asn 

his 

arg 

gin 

ala 

ser 


leu 

glu 

ile ser 

pro 

ile 

thr phe 

leu 

thr 

ala 

gin 

thr 

5 

leu 

leu 

met asp 

leu 

gXy 

gin phe 

leu 

leu 

phe 

cys 

his 


ile 

ser 

ser his 

gin 

his 

asp gly 

met 

glu 

ala 

tyr 

val 


lys 

val 

asp ser 

cys 

pro 

glu glu 

pro 

gin 

leu 

arg 

met 


lys 

asn 

asn glu 

glu 

ala 

glu asp 

tyr 

asp 

asp 

asp 

leu 


thr 

asp 

ser glu 

met 

asp 

val val 

arg 

phe 

asp 

asp 

asp 

10 

asn 

ser 

pro ser 

phe 

ile 

gin ile 

arg 

ser 

val 

ala 

lys 


lys 

his 

pro lys 

thr 

trp 

val his 

tyr 

ile 

ala 

ala 

glu 


glu 

glu 

asp trp 

asp 

tyr 

ala pro 

leu 

val 

leu 

ala 

pro 


asp 

asp 

arg ser 

tyr 

lys 

ser gin 

tyr 

leu 

asn 

asn 

gly 


pro 

gin 

arg ile 

gly 

arg 

lys tyr 

lys 

lys 

val 

arg 

phe 

15 

met: 

ala 

tyr thr 

asp 

glu 

thr phe 

lys 

thr 

arg 

glu 

ala 


ile 

gin 

his glu 

ser 

gly 

ile leu 

gly 

pro 

leu 

leu 

tyr 


gly 

glu 

val gly 

asp 

thr 

leu leu 

ile 

ile 

phe 

lys 

asn 


gin 

ala 

ser arg 

pro 

igrr 

asn ile 

tyr 

pro 

his 

gly 

ile 


thr 

asp 

val arg 

pro 

leu 

tyr ser 

arg 

arg 

leu 

pro 

lys 

20 

gly 

^ral 

lys his 

leu 

lys 

asp phe 

pro 

ile 

leu 

pro 

gly 


glu 

ile 

phe lys 

tyr 

lys 

trp thr 

val 

thr 

val 

glu 

asp 


gly 

pro 

thr lys 

ser 

asp 

pro arg 

cys 

leu 

thr 

arg 

tyr 


tyr 

ser 

ser phe 

val 

asn 

met glu 

arg 

asp 

leu 

ala 

ser 


gly 

leu 

ile gly 

pro 

leu 

leu ile 

cys 

tyr 

lys 

glu 

ser 

25 

val 

asp 

gin arg 

gly 

asn 

gin ile 

met 

ser 

asp 

lys 

arg 


asa 

val 

ile leu 

phe 

ser 

val phe 

asp 

glu 

asn 

arg 

ser 


trp 

tyr 

leu thr 

glu 

asn 

ile gin 

arg 

phe 

leu 

pro 

asn 


pro 

alai 

gly val 

gin 

leu 

glu asp 

pro 

glu 

phe 

gin 

ala 


ser 

asn 

ile met 

his 

ser 

ile asn 

gly 

tyr 

val 

phe 

asp 

30 

ser 

leu 

gin leu 

ser 

val 

cys leu 

his 

glu 

val 

ala 

tyr 


trp 

tyr 

ile leu 

ser 

ile 

gly ala 

gin 

thr 

asp 

phe 

leu 


ser 

val 

phe phe 

ser 

gly 

tyr thr 

phe 

lys 

his 

lys 

met 


val 

tjr 

glu asp 

thr 

leu 

thr leu 

phe 

pro 

phe 

ser 

gly 


glu 

thr 

val phe 

met 

ser 

met glu 

asn 

pro 

gly 

leu 

trp 

35 

ile 

leu 

gly cys 

his 

asn 

ser asp 

phe 

arg 

asn 

arg 

gly 


met 

thr 

ala leu 

leu 

lys 

val ser 

ser 

cys 

asp 

lys 

asn 


thr 

gly 

asp tyr 

tyr 

glu 

cisp ser 

tyr 

glu 

asp 

ile 

ser 
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ala leu 
ser phe ser 
gly thr gin 
lys ser pro 
5 ile leu ser 
ala ala ile 
val thr trp 
ser gin asn 
ile thr arg 

10 asp tyr asp 
asp phe asp 
arg ser phe 
ala val glu 
pro his val 

15 pro gin phe 
gly ser phe 
glu his leu 
val glu asp 
ser arg pro 

20 glu glu asp 
phe val lys 
val gin his 
cys lys ala 
lys asp val 

25 cys his thr 
val thr val 
ile phe asp 
met glu arg 
glu asp pro 

30 ile asn gly 
met ala gin 
met gly ser 
gly his val 
met ala leu 

35 val glu met 
glu cys leu 
thr leu phe 
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leu ser lys asn asn 
gin asp pro leu ala 
ile pro lys glu glu 
glu lys thr ala phe 
leu asn ala cys glu 
asn glu gly gin asn 
ala lys gin gly arg 
pro pro val leu lys 
thr thr leu gin ser 
asp thr ile ser val 
ile tyr asp glu asp 
gin lys lys thr arg 
arg leu trp asp tyr 
leu arg asn arg ala 
lys lys val val phe 
thr gin pro leu tyr 
gly leu leu gly pro 
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ile leu leu phe ala val phe 
ids ser glu thr lys asn ser 
5 ala ala ser ala arg ala trp 
asn gly try val asn arg ser 
cys his arg lys ser val tyr 
gly thr thr pro glu val his 
his thr phe leu val arg asn 

10 glu ile ser pro ile thr phe 
leu met asp leu gly gin phe 
ser ser his gin liis asp gly 
val cisp ser cys pro glu glu 
asn asn glu glu ala glu asp 

15 asp ser glu met asp val val 
ser pro ser phe ile gin ile 
his pro lys thr trp val his 
glu asp trp asp tyr ala pro 
asp arg ser tyr lys ser gin 

20 gin arg ile gly arg lys tyr 
ala lyr thr asp glu thr phe 
gin his glu ser gly ile leu 
glu val gly asp thr leu leu 
ala ser arg pro tyr asn ile 

25 asp val arg pro leu tyr ser 
val lys his leu lys asp phe 
ile phe lys tyr lys trp thr 
pro thr lys ser asp pro arg 
ser ser phe val asn met glu 

30 leu ile gly pro leu leu ile 
asp gin arg gly asn gin ile 
val ile leu phe ser val phe 
tyr leu thr glu asn ile gin 
ala gly val gin leu glu asp 

35 asn ile met his ser ile asn 
leu gin leu ser val cys leu 
tyr ile leu ser ile gly ala 
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val phe phe ser gly tyr thr phe lys his lys met val 
tyr glu asp thr leu thr leu phe pro phe ser gly glu 
thr val phe met ser met glu asn pro gly leu trp ile 
leu gly cys his asn ser asp phe arg asu arg gly met 
5 thr ala leu leu lys val ser ser aj^ asp lys asn thr 
gly asp tyr tyr glu asp ser tyr glu asp ile ser ala 
tyr leu leu ser lys asn asn ala ile glu pro arg ser 
phe ser gin asp pro leu ala trp asp asn his tyr gly 
thr gin ile pro lys glu glu trp lys ser gin glu lys 

10 ser pro glu lys thr ala phe lys lys lys asp thr ile 
leu ser leu asn ala cys glu ser asn his ala ile ala 
ala ile asn glu gly gin asn lys pro glu ile glu val 
thr trp ala lys gin gly arg thr glu arg leu cys ser 
gin asn pro pro val leu lys arg his gin arg glu ile 

15 thr arg thr thr leu gin ser asp gin glu glu ile asp 
tyr asp asp thr ile ser val glu met lys lys glu asp 
phe asp ile. tyr asp glu asp glu asn gin ser pro arg 
ser phe gin lys lys thr arg his tyr phe ile ala ala* 
val glu arg leu trp asp tyr gly met ser ser ser pro 

20 his val leu arg asn arg ala gin ser gly ser val pro 
gin phe lys lys val val phe gin glu phe thr asp gly 
ser phe thr gin pro leu 1^ arg gly glu leu asn glu 
his leu gly leu leu gly pro tyr ile arg ala glu val 
glu asp asn ile met val thr phe arg asn gin ala ser 

25 arg pro tyr ser phe tyr ser ser leu ile ser tyr glu 
glu asp gin arg gin gly ala glu pro arg lys asn phe 
val lys pro asn glu thr lys thr tyr phe trp lys val 
gin his his met ala pro thr lys asp glu phe asp cys 
lys ala trp ala tyr phe ser asp val asp leu glu lys 

30 asp val his ser gly leu ile gly pro leu leu val cys 
his thr asn thr leu asn pro ala his gly arg gin val 
thr val gin glu phe ala leu phe phe(leu) thr ile 
phe asp glu thr lys ser trp tyr phe thr glu asn met 
glu arg asn cys arg ala pro cys asn ile gin met glu 

35 asp pro thr phe lys glu asn tyr arg phe his ala ile 
asn gly 1^ ile met asp thr leu pro gly leu val met 
ala gin asp gin arg ile arg trp tyr leu leu ser met 
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gly ser asn glu asn ile his ser ile his phe ser gly 
his val phe thr val arg lys lys glu glu tyr lys met 
ala leu tyr asn leu tyr pro gly val phe glu thr val 
glu met leu pro ser lys ala gly ile trp arg val glu 
5 cys leu ile gly glu his leu his ala gly met ser thr 
leu phe leu val tyr ser asn lys cys gin thr pro leu 
gly met ala ser gly his ile arg asp phe gin ile thr 
ala ser gly gin tyr gly gin trp ala pro lys leu ala 
arg leu his tyr ser gly ser ile asn ala trp ser thr 

10 lys glu pro phe ser trp ile lys val asp leu leu ala 
pro met ile ile his gly ile lys thr gin gly ala arg 
gin lys phe ser ser leu tyr ile ser gin phe ile ile 
met ser leu asp gly lys lys trp gin thr arg 
gly asn ser thr gly thr leu met val phe phe gly asn 

15 val asp ser ser gly ile lys his asn ile phe asn pro 
pro ile ile ala arg 1^ ile arg leu his pro thr his 
tyr ser ile arg ser thr leu arg met glu leu met gly 
cys asp leu asn ser cys ser met pro leu gly met glu 
ser lys ala ile ser asp ala gin ile thr ala ser ser 

20 tyr phe thr asn met phe ala thr trp ser pro ser lys 
ala arg leu his leu gin gly arg ser asn ala trp arg 
pro gin val asn asn pro lys glu trp leu gin val asp 
phe gin lys thr met lys val thr gly val thr thr gin 
gly val lys ser leu leu thr ser met tyr val lys glu 

25 phe leu ile ser ser ser gin asp gly his gin trp thr 
leu phe phe gin asn gly lys val lys val phe gin gly 
asn gin asp ser phe thr pro val val asn ser leu asp 
pro pro leu leu thr arg tyr leu arg ile his pro gin 
ser trp val his gin ile ala leu arg met glu val leu 

30 gly cys glu ala gin asp leu 1^; 

met ala thr arg arg tyr tjr leu gly ala val glu leu 
ser trp asp tyr met gin ser asp leu gly glu leu pro 
val asp ala arg phe pro pro arg val pro lys ser phe 
pro phe asn thr ser val val tyr lys lys tlir leu phe 

35 val glu phe thr asp his leu phe asn ile ala lys pro 
arg pro pro trp met gly leu leu gly pro thr ile gin 
ala glu val tyr asp thr val val ile thr leu lys asn 
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?• A modified factor VIII:C-like polypep- 
30 tide, comprising the N- terminal heavy chain of mature 
factor VIII :C linked directly to the C- terminal light 
chain of mature factor VIII :C, said polypeptide being 
essentially free of other serum proteins. 


8. A process for producing a polypeptide, 
35 coii5>rising the step of proteolytically cleaving the 
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modified factor VIII:C-like polypeptide of claim 7 
into the N-terminal heavy chain of mature factor 
VIII:C and the C-terminal light chain of mature 
factor VIII :C- 

5 9. The process according to claim 8,. 

further comprising the step of linking together by 
an alkaline metal bridge, the N-terminal heavy chain 
of mature factor VIII :C and the C-terminal heavy 
chain of mature factor VIII rC* 

10 10. A process for producing a modified 

factor VIII:C-like polypeptide comprising the step 
of culturing a host transformed with a recombinant 
DNA molecule as defined in claims 1 through 5. 

11. The process according to any of 
15 claims 8, 9 or 10, wherein the host is selected 

from BMTIO, BSCl, BSC40, COSl, COST, CBO cells and 
other animal and human cells in culture. 

12. A pharmaceutical composition comprising 
a polypeptide, produced according to the process of 

20 any of claims 8, 9 or 10, in an amount effective as 

a coagulant and a pharmaceutically acceptable carrier. 

13. A pharmaceutical coniposition co2q)rising 
a modified factor VIII:C-like polypeptide as defined 
in any one of claims 6-9 in an amount effective as a 

25 coagulant and a pharmaceutically acceptable carrier. 

14. A method for treating haemophilia 
comprising the step of treating a,^ human with the 
pharmaceutical coH^osition as defined in claims 12 
and 13. 
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15 • A recombinant DNA molecule according 
to claim 4, selected from the group consisting of 
the recombinant DNA molecules contained in transformed 
host E^coli HBIOI(RE), E,coli (HBIOI(RD) or E.coli 
5 HBIOI(RSD). 

16. A modified factor VIII:C-like polypep- 
tide produced by a host transformed with a recombinant 
DNA molecule selected from a group consisting of 
recombinant DNA molecules contained in transformed 
10 host E^coli HBIOI(RE), E.coli HBIOI(RD) and E.coli 
HBIOI(RSD). 


17. A process for producing a modified 
factor VIII:C-lilce polypeptide coii5)rising the step 
of culturing a host transformed with a recombinant 
15 DNA molecule selected from a group consisting of 
recombinant DNA molecules contained in transformed 
host E.coli HBIOI(RE), E.coli HBIOI(RD) and E.coli 
HBIOI(RSD). 


18. A pharmaceutical coiq)osition comprising 
20 a polypeptide produced according to the process of 

claim 17 in an amount effective as a coagulant and a 
pharmaceutically acceptable carrier. 

19. A method for treating haemophilia 
comprising the step of treating a human with a phar- 
maceutical con^osition according to claim 18. 
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1 10 
ala thr arg arg tyr tyr leu giy ala val glu leu ser 
GCC ACC AGA AGA TAG TAG CTG GGT. GCA GTG GAA CTG TCA 


20 

trp asp tyr met gin ser asp leu gly glu leu pro val 
TGG GAG TAT ATG GAA ACT GAT CTG GGT GAG CTG CCT GTG 


30 

asp ala arg phe pro pro arg val pro lys ser phe pro 
GAG GCA AGA TTT GGT GGT AGA GTG GCA AAA TCT TTT CCA 

40 50 
phe asn thr ser val val tyr lys lys thr leu phe val 
TTC AAC ACC TCA GTC GTG TAG AAA AAG ACT CTG TTT GTA 


ecoRZ 60 

glu phe thr asp his leu phe asn lie ala lys pro arg 
GAA TTC ACG GAT GAG CTT TTC AAG ATC GCT AAG CCA AGG 


70 

pro pro trp met gly leu leu gly pro thr lie gin ala 
CCA GCC TGG ATG GGT CTG GTA GGT CCT ACC ATC GAG GCT 


80 90 
glu val tyr asp thr val val lie thr leu lys asn met 
GAG GTT TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG 


100 

ala ser his pro val ser leu his ala val gly val ser 
GCT TCC CAT CCT GTC ACT CTT CAT GCT GTT GGT GTA TGG 


FIG. 7 
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hindlll 110 
tyr trp lys ala ser glu gly ala glu tyr asp asp gin 
TAC TGG AAA -GCT TCT GAG GGA GCT GAA TAT GAT GAT CAG 

120 130 
thr ser gin arg glu lys glu asp asp. lys.vaL -pite:=pro 
ACC ACT CAA AGG GAG AAA GAA GAT C»X: AAA . GTC TTC: CCT 

"Z " W. .- ,4 

gly gly ser his thr tyr val trp gin. val.- len- lys:-.gla 
GGT GGA AGC CAT ACA TAT GTC TGG CAC Glir CEK AfiffvGAG 

150 

asn gly pro met ala ser asp pro leu cys leu thr tyr 
AAT GCT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC 

160 ecoRI 
ser tyr leu ser his val asp leu val lys asp leu asn 
TCA TAT CTT TCT CAT CTG GAC CTG CTA AAA GAC TTG AAT 

170 180 

ser gly leu ile gly ala leu leu val cys arg glu gly 

TCA GGC CTC ATT GGA GCC CTA CTA GTA TCT AGA GAA GGG 

190 

ser leu ala lys glu lys thr gin thr leu his lys phe 
ACT CTG GCC AAG GAA AAG ACA CAC ACC TTG CAC AAA TTT 

200 

ile leu leu phe ala val phe asp glu gly lys ser trp 
ATA CTA CTT TTT GCT CTA TTT GAT GAA GGG AAA ACT TGG 

210 220 
his ser glu thr lys asn ser leu met gin asp arg asp 
CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT 


F I G. 7(cont'd} 
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230 

ala ala ser ala arg ala trp pro lys met his thr val 
GCT GCA TCT GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC 


240 

asn gly try val asn arg ser leu pro gly leu ile gly 
AAT GGT TAT GTA AAC AGG TCT CTG CCA GGT CTG ATT GGA 


250 260 
cys his arg lys ser val tyr trp his val ile gly met 
TGC CAC AGG AAA TCA GTC TAT TGG CAT GTG ATT GGA ATG 

270 

gly thr thr pro glu val his ser ile phe leu glu gly 
GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC GAA GGT 


280 

his thr phe leu val arg asn 
CAC ACA TTT CTT GTG AGG AAC 

290 

glu ile ser pro ile thr phe 
GAA ATC TCG' CCA ATA ACT TTC 


his arg gin ala ser leu 
CAT CGC CAG GCG TCC TTG 


leu thr ala gin thr leu 
CTT ACT GCT CAA ACA CTC 


300 310 

leu met eisp leu gly gin phe leu leu phe cys his ile 

TTG ATG GAC CTT GGA CAG TTT CTA CTG TTT TCT CAT ATC 


320 hindlll 

ser ser his gin his asp gly met glu ala tyr val lys 
TCT TCC CAC CAA CAT GAT GGC ATG GAA GCT TAT CTC AAA 


330 

val asp ser cys pro glu glu pro gin leu arg met lys 
CTA GAC AGC TCT CCA GAG GAA CCC CAA CTA CGA ATG AAA 
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340 350 
asn asn glu glu ala glu asp tyr asp asp asp leu thr 
AAT AAT GAA GAA GCG GAA GAC TAT CAT GAT GAT CTT ACT 


360 

asp ser glu met asp val val arg phe asp asp asp asn 
GAT TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC 


370 

ser pro ser phe ile gin ile 
TCT CCT TCC TTT ATC CAA ATT 

380 

his pro lys thr trp val his 
CAT CCT AAA ACT TGG GTA CAT 


arg ser val ala lys lys 
CGC TCA GTT GCC AAG AAG 

390 

tyr ile ala ala glu glu 
TAC ATT GCT GCT GAA GAG 


400 

glu asp trp asp tyr ala pro leu val leu ala pro asp 
GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC GCC CCC GAT 


410 

asp arg ser tyr lys ser gin tyr leu asn asn gly pro 
GAC AGA ACT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 


420 

gin arg ile gly arg lys tyr lys lys val arg phe met 
CAG CGG ATT GGT AGG AAG TAC AAA AAA CTC CGA TTT ATG 


430 440 

ala tyr thr asp glu thr phe lys thr arg glu ala ile 

GCA TAC ACA GAT GAA ACC TTT AAG ACT CCT GAA GCT ATT 

450 

gin his glu ser gly ile leu gly pro leu leu tyr gly 
CAG CAT GAA TCA GGA ATC TTG GGA CCT TTA CTT TAT GGG 
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460 

glu val gly asp thr leu leu ile ile phe lys asn gin 
GAA GTT GGA GAC ACA CTG TTG ATT ATA TTT AAG AAT CAA 

470 480 
ala ser arg pro tyr asn ile tyr pro his gly ile thr 
GCA AGC AGA CCA TAT AAC ATC TAG CCT CAC GGA ATC ACT 

490 

asp val arg pro leu tyr ser arg arg leu pro lys gly 
GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA AAA GGT 

500 

val lys his leu lys asp phe pro ile leu pro gly glu 
GTA AAA CAT TTG AAG GAT TTT CCA ATT CTG CCA GGA GAA 

510 520 
ile phe lys tyr lys trp thr val thr val glu asp gly 
ATA TTC AAA TAT AAA TGG ACA GTG ACT GTA GAA GAT GGG 

530 

pro thr lys ser asp pro arg cys leu thr arg tyr tyr 
CCA ACT AAA TCA GAT CCT CGG TGC CTG ACC CGC TAT TAC 

540 

ser ser phe val asn met glu arg asp leu ala ser gly 
TCT ACT TTC GTT AAT ATG GAG AGA GAT CTA GCT TCA GGA 

550 

leu ile gly pro leu leu ile cys tyr lys glu ser val 
CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT GTA 

560- 570 

asp gin arg gly asn gin ile met ser asp lys arg asn 

GAT CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT 
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580 k 
val ile leu phe ser val phe asp glu asn arg ser trp 
GTC ATC CTG TTT TCT GTA TTT GAT GAG AAC CGA AGC TGG 


pnl 590 

tyr leu thr glu asn ile gin arg phe leu pro asn pro 
TAG CTC ACA GAG AAT ATA CAA CGC TTT GTC CCC AAT CCA 


600 bamHI 610 

ala gly val gin leu glu asp pro glu phe gin ala ser 
GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC CAA GCC TCC 


asn ild met his ser ile 
AAC ATC ATG CAC AGC ATC 

630 

leu gin leu ser val cys 
TTG CAG TTG TCA GTT TGT 


620 

asn gly tyr val phe asp ser 
AAT GGC TAT GTT TTT GAT AGT 

leu his glu val ala tyr trp 
TTG CAT GAG GTG GCA TAC TGG 


640 650 
tyr ile leu ser ile gly ala gin thr asp phe leu ser 
TAC ATT CTA AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT 


660 

val phe phe ser gly tyr thr phe lys his lys met val 
GTC TTC TTC TCT GGA TAT ACC TTC AAA CAC AAA ATG GTC 


670 

tyr glu asp thr leu thr leu phe pro phe ser gly glu 
TAT GAA GAC ACA CTC ACC CTA TTC CCA TTC TCA GGA GAA 


680 _ 
thr val phe met ser met glu asn pro gly leu trp ile 
ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG ATT 
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690 700 

leu gly cys his asn ser asp phe arg asn arg gly met 

CTG GGG TGC CAC AAC TCA GAC TTT JZCG AAC AGA GGC ATG 


710 

thr ala leu leu lys val ser ser cys asp lys asn thr 
ACC GCC TTA CTG AAG GTT TCT ACT TGT GAC AAG AAC ACT 


720 

gly asp tyr tyr glu asp ser tyr glu asp ile ser ala 
GGT GAT TAT TAG GAG GAC AGT TAT GAA GAT ATT TCA GCA 


730 

tyr leu leu ser lys asn asn 
TAC TTG CTG ACT AAA AAC AAT 

II ecoRI 

phe ser gin asn ser arg his 

TTC TCC CAG AAT TCA AGA CAC 

760 

gin phe asn ala thr thr ile 
CAA TTT AAT GCC ACC ACA ATT 


hindl 

ala ile glu pro arg ser 
GCC ATT GAA CCA AGA AGC 

750 

pro ser thr arg gin lys 
CCT AGC ACT AGG CAA AAG 


pro glu asn asp ile glu 
CCA GAA AAT GAC ATA GAG 


770 780 
lys thr asp pro trp phe ala his arg thr pro met pro 
AAG ACT GAC CCT TGG TTT GCA CAC AGA ACA CCT ATG CCT 


790 

lys ile gin asn val ser ser ser asp leu leu met leu 
AAA ATA CAA AAT CTC TCC TCT ACT GAT TTG TTG ATG CTC 


800 

leu arg gin ser pro thr pro his gly leu ser leu ser 
TTG CGA CAG ACT CCT ACT CCA CAT GGG CTA TCC TTA TCT 
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810 

asp leu gin glu ala lys tyr glu thr phe ser asp asp 
GAT CTC CAA GAA GCC AAA TAT GAG ACT TTT TCT GAT GAT 

820 830 

pro ser pro gly ala lie asp ser asn asn ser leu ser 

CCA TCA CCT GGA GCA ATA GAC AGT AAT AAC AGC CTG TCT 

840 

glu met thr his phe arg pro gin leu his his ser gly 
GAA ATG ACA CAC TTC AGO CCA CAG CTC CAT CAC AGT GGG 

850 

asp met val phe thr pro glu ser gly leu gin leu arg 
GAC ATG GTA TTT ACC CCT GAG TCA GGC CTC CAA TTA AGA 

860 870 
leu asn glu lys leu gly thr thr ala ala thr glu leu 
TTA AAT GAG AAA CTG GGG ACA ACT GCA GCA ACA GAG TTG 

880 

lys lys leu asp phe lys val ser ser thr ser asn asn 
AAG AAA CTT GAT TTC AAA GTT TCT AGT ACA TCA AAT AAT 

890 

leu ile ser thr ile pro ser asp asn leu ala ala gly 
CTG ATT TCA ACA ATT CCA TCA GAC AAT TTG GCA GCA GOT 

900 910 
thr asp asn thr ser ser leu gly pro pro ser met pro 
ACT GAT AAT ACA AGT TCC TTA GGA CGC CCA AGT ATG CCA 

920 

val his tyr asp ser gin leu asp thr thr leu phe gly 
GTT CAT TAT GAT AGT CAA TTA GAT ACC ACT CTA TTT GGC 
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930 

lys lys ser ser pro leu thr glu ser gly gly pro leu 
AAA AAG TCA TCT CCC CTT ACT GAG TCT GGT GGA CCT CTG 

940 

ser leu ser glu glu asn asn asp ser lys lea. Leu glu 
AGC TTG ACT GAA GAA AAT AAT GAT TCA A&C^ TTG". .TIA GA& . 

950 9m 

ser gly leu met asn ser gin glu se^./sejr:trp:..gpL7 Lys:. 

TCA GGT TTA ATG AAT AGC CAA GAA A(IL TC&iTCC^'GXSA AAA 

970 

asn val ser ser thr glu ser gly arg leu phe lys gly 
AAT GTA TCG TCA ACA GAG AGT GGT AGG TTA TTT AAA GGG 


sad 980 
lys arg ala his gly pro ala 
AAA AGA GCT CAT GGA CCT GCt 

990 

ala leu phe lys val ser ile 
GCC TTA TTC AAA GTT AGC ATC 


leu leu thr lys asp asn 
TTG TTG ACT AAA GAT AAT 

1000 

ser leu leu lys thr asn 
TCT TTG TTA AAG ACA AAC 


1010 

lys thr ser asn asn ser ala thr asn arg lys thr his 
AAA ACT TCC AAT AAT TCA GCA ACT AAT AGA AAG ACT CAC 

1020 

ile asp gly pro ser leu leu ile glu asn ser pro ser 
ATT GAT GGC CCA TCA TTA TTA ATT GAG AAT AGT CCA TCA 


1030 1040 
val trp gin asn ile leu glu ser asp thr glu phe lys 
GTC TGG CAA AAT ATA TTA GAA AGT GAC ACT GAG TTT AAA 
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1050 

lys val thr pro leu ile his asp arg met leu met asp 
AAA GTG ACA CCT TTG ATT CAT GAC AGA ATG CTT ATG GAC 

1060 

lys asn ala thr ala leu arg leu asn his met ser asn 
AAA AAT GCT ACA OCT TTG AGG CTA AAT CAT ATG TCA AAT 

1070 

lys thr thr ser ser lys asn met glu met val gin gin 
AAA ACT ACT TCA TCA AAA AAC ATG GAA ATG GTC CAA CAG 

1080 1090 

lys lys glu gly pro ile pro pro asp ala gin asn pro 

AAA AAA GAG GGC CCC ATT CCA CCA GAT GCA CAA AAT CCA 

1100 

asp met ser phe phe lys met leu 
GAT ATG TCG TTC TTT AAG ATG CTA 

1110 

ala arg trp ile gin arg thr his 
GCA AGG TGG ATA CAA AGG ACT CAT 

1120 

asn ser gly gin gly pro ser pro 
AAC TCT GGG CAA GGC CCC ACT CCA 


leu gly pro glu lys ser val glu 
TTA GGA CCA GAA AAA TCT GTG GAA 

1150 

ser glu lys asn lys val val val 
TCT GAG AAA AAC AAA CTG GTA CTA 


phe leu pro glu ser 
TTC TTG CCA GAA TCA 


gly lys asn ser leu 
GGA AAG AAC TCT CTG 

1130 

lys gin leu val ser 
AAG CAA TTA CTA TCC 

1140 

gly gin asn phe leu 
GCT CAG AAT TTC TTG 


gly lys gly glu phe 
GGA AAG GCT GAA TTT 
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H60 


1170 


thr lys asp val gly leu lys glu met val phe pro ser 
ACA AAG GAC GTA GGA CTC AAA GAG ATG GTT TTT CCA AGC 

1180 

ser arg asn leu phe leu thr asn leu asp asn leu his 
AGO AGA AAC CTA TTT CTT ACT AAC TTG GAT AAT TTA CAT 

1190 

glu asn asn thr his asn gin glu lys lys ile gin glu 
GAA AAT AAT ACA CAC AAT CAA GAA AAA AAA ATT CAG GAA 

1200 

glu ile glu lys lys glu thr leu ile gin glu asn val 
GAA ATA GAA AAG AAG GAA ACA TTA ATC CAA GAG AAT GTA 

1210 1220 

val leu pro gin ile his thr val thr gly thr lys asn 

GTT TTG CCT CAG ATA CAT ACA GTG ACT GGC ACT AAG AAT 

1230 

phe met lys asn leu phe leu leu ser thr arg gin asn 
TTC ATG AAG AAC CTT TTC TTA CTG AGC ACT AGG CAA AAT 


val glu gly ser tyr asp gly ala tyr ala pro val leu 
GTA GAA GGT TCA TAT GAC GGG GCA TAT GCT CCA GTA CTT 


gin asp phe arg ser leu asn asp ser thr asn arg thr 
CAA GAT TTT AGG TCA TTA AAT GAT TCA ACA AAT AGA ACA 

- 1270 
lys lys his thr ala his phe ser lys lys gly glu glu 
AAG AAA CAC ACA GCT CAT TTC TCA AAA AAA GGG GAG GAA 


1240 


seal 


1250 


1260 
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1280 

glu asn leu glu gly leu gly asn gin thr lys gin ile 
GAA AAC TTG GAA GGC TTG GGA AAT CAA ACC AAG CAA ATT 

1290 sphi 1300 
val glu lys tyr ala cys thr thr arg ile ser pro asn 
GTA GAG AAA TAT GCA TGC ACC ACA AGG ATA TCT CCT AAT 

1310 

thr ser gin gin asn phe val thr gin arg ser lys arg 
ACA AGC GAG CAG AAT TTT GTC ACG CAA CGT AGT AAG AGA 

1320 

ala leu lys gin phe arg leu pro leu glu glu thr glu 
GCT TTG AAA CAA TTC AGA CTC CCA CTA GAA GAA ACA GAA 

1330 

leu glu lys arg ile ile val asp asp thr ser thr gin 
CTT GAA AAA AGG ATA ATT GTG GAT GAC ACC TCA ACC CAG 

1340 1350 

trp ser lys asn met lys his leu thr pro ser thr leu 

TGG TCC AAA AAC ATG AAA CAT TTG ACC CCG AGC ACC CTC 

1360 

thr gin ile asp tyr asn glu lys glu lys gly ala ile 
ACA CAG ATA GAC TAC AAT GAG AAG GAG AAA GGG GCC ATT 

1370 

thr gin ser pro leu ser asp cys leu thr arg ser his 
ACT CAG TCT CCC TTA TCA GAT TCG CTT ACG AGG AGT CAT 

1380 1390 
ser ile pro gin ala asn arg ser pro leu pro ile ala 
AGC ATC CCT CAA GCA AAT AGA TCT CCA TTA CCC ATT GCA 
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lys val ser ser phe pro ser ile arg pro ile tyr leu 
AAG GTA TCA TCA TTT CCA TCT ATT AGA CCT ATA TAT CTG 

1410 

thr arg val leu phe gin asp asn ser ser his leu pro 
ACC AGO GTC CTA TTC CAA GAC AAC TCT TCT CAT CTT CCA 

1420 1430 
ala ala ser tyr arg lys lys asp ser gly val gin glu 
GCA GCA TCT TAT AGA AAG AAA GAT TCT GGG GTC CAA GAA 

1440 

ser ser his phe leu gin gly ala lys lys asn asn leu 
AGC AGT CAT TTC TTA CAA GGA GCC AAA AAA AAT AAC CTT 

1450 

ser leu ala ile leu thr leu glu met thr gly asp gin 
TCT TTA GCC ATT CTA ACC TTG GAG ATG ACT GGT GAT CAA 

1460 

arg glu val gly ser leu gly thr ser ala thr asn ser 
AGA GAG GTT GGC TCC CTG GGG ACA AGT GCC ACA AAT TCA 

1470 1480 

val thr tyr lys lys val glu asn thr val leu pro lys 

GTC ACA TAC AAG AAA GTT GAG AAC ACT GTT CTC CCG AAA 

1490 

pro asp leu pro lys thr ser gly lys val glu leu leu 
CCA GAC TTG CCC AAA ACA TCT GGC AAA GTT GAA TTG CTT 

1500 

pro lys val his ile tyr gin lys asp leu phe pro thr 
CCA AAA GTT CAC ATT TAT CAG AAG GAC CTA TTC CCT ACG 
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1510 1520 
glu thr ser asn gly ser pro gly his leu asp leu val 
GAA ACT AGC AAT GGG TCT CCT GGC CAT CTG GAT CTC GTG 

1530 

glu gly ser leu leu gin gly thr glu gly ala ile lys 
GAA GGG AGC CTT CTT CAG GGA ACA GAG GGA GCG ATT AAG 

1540 

trp asn glu ala asn arg pro gly lys val pro phe leu 
TGG AAT GAA GCA AAC AGA CCT GGA AAA CTT CCC TTT CTG 

1550 1560 
arg val ala thr glu ser ser ala lys thr pro ser lys 
AGA CTA GCA ACA GAA AGC TCT GCA AAG ACT CCC TCC AAG 

bainHI 1570 
leu leu asp pro leu ala trp asp asn his tyr gly thr 
CTA TTG GAT CCT CTT GCT TGG GAT AAC CAC TAT GCT ACT 

1580 

gin ile pro lys glu glu trp lys ser gin glu lys ser 
CAG ATA CCA AAA GAA GAG TGG AAA TCC CAA GAG AAG TCA 

1590 

pro glu lys thr ala phe lys lys lys asp thr ile leu 
CCA GAA AAA ACA GCT TTT AAG AAA AAG GAT ACC ATT TTG 

1600 1610 

ser leu asn ala cys glu ser asn his ala ile ala ala 

TCC CTG AAC GCT TCT GAA AGC AAT CAT GCA ATA GCA GCA 

1620 

ile asn glu gly gin asn lys pro glu ile glu val thr 
ATA AAT GAG GGA CAA AAT AAG CCC GAA ATA GAA CTC ACC 
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1630 

trp ala lys gin gly arg thr glu arg leu cys ser gin 
TGG GCA AAG CAA GGT AGG ACT GAA AGG CTG TGC TCT CAA 

1640 1650 
asn pro pro val leu lys arg his gin arg glu ile thr 
AAC CCA CCA GTC TTG AAA CGC CAT CAA CGG GAA ATA ACT 

1660 

arg thr thr leu gin ser asp gin glu glu ile asp tyr 
CGT ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT GAC TAT 

1670 

asp asp thr ile ser val glu met lys lys glu asp phe 
GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA GAT TTT 

1680 1690 
asp ile tyr asp glu asp glu asn gin ser pro arg ser 
GAC ATT TAT GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC 

1700 

phe gin lys lys thr arg his tyr phe ile ala ala val 
TTT CAA AAG AAA ACA CGA CAC TAT TTT ATT GCT GCA GTG 

1710 

glu arg leu trp asp tyr gly met ser ser ser pro his 
GAG AGG CTC TGG GAT TAT GGG ATG AGT AGC TCC CCA CAT 

1720 

val leu arg asn arg ala gin ser gly ser val pro gin 
GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT GTC CCT CAG 

1730 - 1740 

phe lys lys val val phe gin glu phe thr asp gly ser 
TTC AAG AAA GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC 
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phe thr gin pro leu tyr 
TTT ACT CAG CCC TTA TAC 


1750 

arg gly glu leu asn glu his 

CGT GGA GAA CTA AAT GAA CAT 


1760 

leu gly leu leu gly pro tyr ile arg ala glu val glu 
TTG GGA CTC CTG GGG CCA TAT ATA AGA GCA GAA GTT GAA 


1770 1780 
asp asn ile met val thr phe arg asn gin ala ser arg 
GAT AAT ATC ATG GTA ACT TTC AGA AAT CAG GCC TCT CGT 


pro tyr ser phe tyr ser ser 
CCC TAT TCC TTC TAT TCT AGC 

1800 

asp gin arg gin gly ala glu 
GAT CAG AGG CAA GGA GCA GAA 

1810 

lys pro asn glu thr lys thr 
AAG CCT AAT GAA ACC AAA ACT 


1790 

leu ile ser tyr glu glu 
CTT ATT TCT TAT GAG GAA 


pro arg lys asn phe val 
CCT AGA AAA AAC TTT GTC 

1820 

tyr phe trp lys val gin 
TAC TTT TGG AAA GTG CAA 


1830 

his his met ala pro thr lys asp glu phe asp cys lys 
CAT CAT ATG GCA CCC ACT AAA GAT GAG TTT GAC TGC AAA 


1840 

ala trp ala tyr phe ser asp val asp leu glu lys asp 
GCC TGG GCT TAT TTC TCT GAT GTT GAC CTG GAA AAA GAT 


1850 

val his ser gly leu ile gly pro leu leu val cys his 
GTG CAC TCA GGC CTG ATT GGA CCC CTT CTG GTC TGC CAC 
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1860 

thr asn thr leu asn pro ala 
ACT AAC ACA CTG AAC CCT GCT 


val gin glu phe ala leu phe 
GTA CAG GAA TTT GCT CTG TTT 

1890 

thr lys ser trp tyr phe thr 
ACC AAA AGC TGG TAG TTC ACT 

1900 

cys arg ala pro cys asn ile 
TGC AGG GCT CCC TGC AAT ATC 


phe lys glu asn tyr arg phe 
TTT AAA GAG AAT TAT CGC TTC 

1930 

ile met asp thr leu pro gly 
ATA ATG GAT ACA CTA CCT GGC 

1940 

gin arg ile arg trp tyr leu 
CAA AGG ATT CGA TGG TAT CTG 


glu asn ile his ser ile his 
GAA AAC ATC CAT TCT ATT CAT 

1970 

thr val arg lys lys glu glu 
ACT GTA CGA AAA AAA GAG GAG 


1870 

his gly arg gin val thr 
CAT GGG AGA CAA GTG ACA 

1880 

phe thr ile phe asp glu 
TTC ACC ATC TTT GAT GAG 


glu asn met glu arg asn 
GAA AAT ATG GAA AGA AAC 

1910 

gin met glu asp pro thr 
CAG ATG GAA GAT CCC ACT 

1920 

his ala ile asn gly tyr 
CAT GCA ATC AAT GGC TAG 


leu val met ala gin asp 
TTA GTA ATG GCT CAG GAT 

1950 

leu ser met gly ser asn 
CTC AGC ATG GGC AGC AAT 

1960 

phe ser gly his val phe 
TTC AGT GGA CAT GTG TTC 


tyr lys met ala leu tyr 
TAT AAA ATG GCA CTG TAG 
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1980 

^ leu tyr pro gly val phe !^ 

MI CIC TAT CCA GOT OIT TIT GAG XCA GIG GAA ATG ITA 

2000 

o^rser lys ala gly ile trp arg val glu cys leu ile 
Ccl TCC 1 OCT L ATT TGG CGG GTG GAA TGC CTT ATT 

2010 

gly glu ixis leu his ala gly met ser thr leu phe leu 
GGC GAG CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT CTG 

2020 

val tyr ser asn lys cys 9ln thr pro leu gly nrt ala 
CIG iL AGC AAT AAG TGI CAG ACT CCC CIG GGA ATG GCT 

2040 

2030" , 
ser gly his ile arg asp phe gin ile thr ala ser gly 
^ CAC ATI AGA GAT TTT CAG ATT ACA GCT TCA GGA 

2050 

gin tyr gly gin txp ala pro lys leu ala arg leu his 
CAA TAT GGA CAG TGG GCC CCA AAG CTG GCC AGA CTT CAT 

2060 

tyr ser gly ser ile asn ala trp ser thr lys glu pro 
^ TCA ATC AAT GCC TGG AGC ACC AAG GAG CCC 

2080 

2070 ^ 
phe ser trp ile lys val asp leu leu ala pro met lie 
TCT tS ATC AAG GTG GAT CTG TTG GCA CCA ATG ATT 

2090 

ile his gly ile lys thr gin gly ala 91" 3^ ^ 

ATT CAC GGC AIC AAG ACC CAG GGI GCC CGI CAG AAG TTC 
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2100 

ser ser leu tyr ile ser gin phe ile ile met tyr ser 
TCC AGC CTC TAG ATC TCT CAG TTT AJC ATC ATG TAT AGT 

2110 

leu asp gly lys lys trp gin thr tyr arg gly asn ser 
CTT GAT GGG AAG AAG TGG CAG ACT TAT CGA GGA AAT TCC 

2120 2130 

thr gly thr leu met val phe phe gly asn val asp ser 

ACT GGA ACC TTA ATG GTC TTC TTT GGC AAT GTG GAT TCA 

2140 

ser gly ile lys his asn ile phe asn pro pro ile ile 
TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT CCA ATT ATT 

• 2150 

ala arg tyr ile arg leu his pro thr his tyr ser ile 
GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT 

2170 

2160 ^-^'^ 
arg ser thr leu arg met glu leu met gly cys asp leu 
CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA 

sphi 2180 
asn ser cys ser met pro leu gly met glu ser lys ala 
AAT AGT TGC AGC ATG CCA TTG GGA ATG GAG AGT AAA GCA 

2190 

ile ser asp ala gin ile thr ala ser ser tyr phe thr 
ATA TCA GAT GCA CAG ATT ACT GCT TCA TCC TAC TTT ACC 

2210 

2200 

asn met phe ala thr trp ser pro ser lys ala arg leu 
AAT ATG TTT GCC ACC TGG TCT CCT TCA AAA GCT CGA CTT 
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2220 

his leu gin gly arg ser asn ala trp arg pro gin val 
CAC CTC CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG GTG 

2230 

asn asn pro lys glu trp leu gin val asp phe gin lys 
AAT AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG AAG 

2240 

1-hr met lys val thr gly val thr thr gin gly val lys 
ACA ATG AAA GTC ACA GGA GTA ACT ACT CAG GGA GTA AAA 

2250 2260 

ser leu leu thr ser met tyr val lys glu phe leu ile 

TCT CTG CTT ACC AGC ATG TAT GTG AAG GAG TTC CTC ATC 

2270 

ser ser ser gin asp gly his gin trp thr leu phe phe 
TCC AGC ACT CAA GAT GGC CAT CAG TGG ACT CTC TTT TTT 

2280 

gin asn gly lys val lys val phe gin gly asn gin asp 
CAG AAT GGC AAA CTA AAG GTT TTT CAG GGA AAT CAA GAC 

2290 2300 
ser phe thr pro val val asn ser leu asp pro pro leu 
TCC TTC ACA CCT GTG GTG AAC TCT CTA GAC CCA CCG TTA 

ecoRI 2310 
leu thr arg tyr leu arg ile his pro gin ser trp val 
CTG ACT CGC TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG 

2320 

his gin ile ala leu arg met glu val leu gly cys glu 
CAC CAG ATT GCC CTG AGG ATG GAG GTT CTG GGC TGC GAG 

2330 2332 

ala gin asp leu tyr OP gr i ^ Tf 4'^) 

OCA CAG GAC CTC TAC TGA H ! f (00/77 OJ 
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